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Abstract 

Several aspects of Dirac reduction arc compared and formulated from the same geometric 
point of view. A link with nonholonomic reduction is found. The theory of optimal momentum 
maps and reduction is extended from the category of Poisson manifolds to that of closed Dirac 
manifolds. An optimal reduction method for a class of nonholonomic systems is formulated. 
Several examples arc studied in detail. 
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1 Introduction 



The equations of motion of nonholonomic mechanica l systems and th ose in circuit theory can be geometri- 
cally described using a Dirac str ucture (introduced b y Courant J 1990m) in tak i ng either a Hamiltonian or La- 

grang ia n point of view (see, e.g., Blankenstem ( 2000l ). lBlankenstein and Ratiu ( 20041 ). Blankenstein and van der Schaft 



( 200lh . lYoshimura and Marsden 



(|2006allbl . l2007lV ) 



Dirac structures simultaneously generalize symplectic and Poisson structures and also form the correct 
setting for the description of implicit Hamiltonian and Lagrangian systems usually appearing as systems 
of algebraic-differential equations. In symplectic and Poisson geometry, as well as geometric mechanics, a 
major role is played by the reduction method since it creates, under suitable hypotheses or in categories 
weaker than smooth manifolds, new spaces with the same type of motion equations on them. Briefly put, 
it is a method that eliminates variables and hence yields systems on smaller dimensional manifolds. Due to 
the spectacular array of applications, reduction has been extensively studied in various settings, including 
that of Dirac manifolds. The present paper continues these investigations, connects Dirac and nonholonomic 
reduction, introduces optimal reduction, and presents several classical examples in the different settings 
considered in the rest of the paper. 

A Dirac structure D on a manifold M is a subbundle of the Pontryagin bundle TM © T*M which is 
Lagran gian relative to the canonical symmetric pairing on it. Dirac structures were introduced by I Courant 
(1990b) to provide a geometric framework for the study of constrained mechanical systems. The easiest 
example of a Dirac structure is the graph of a 2-form u> £ ft 2 (M). Closed or integrablc Dirac structures have 
an additional integrability condition. They have been more intensively studied because they generalize, in 
a certain sense, Poisson structures. For example, if the Dirac structure is the graph of w e il 2 (M), then it 
is integrable if and only if dui = 0. Other examples of integrable Dirac structures include various foliated 
manifolds. In general, a closed Dirac structure determines a singular foliation on M whose leaves carry a 
natural induced presymplectic structure. 

In this paper we study several aspects of Dirac reduction. First we recall the necessary background on 
Dirac geometry in Sj2j We begin our investigations with the comparison of two different descriptions of Dirac 
reduction by symmetry groups in Sj3] It is known that under certain assumptions beyond the usual ones, 
the quotient manifold carries a natural Dirac structure. These hypotheses are formu lated in the literature 
in two different manners: using sections (see iBlankenstein and van der Schaft (2001)) or appealing to the 
theory of fiber bundles (see Bursztvn et al. ( 20071 )). While each approach has its advantages and both lead 
to the same result, it turns out that the method using sections needs an additional technical hypothesis, 
discussed in detail in the appendix lAl We show in §4] that Dirac reduction as pr e sented in [JH coincides with 
the method of reduction for nonholonomic systems due to lBates and Sniatvcki (1199.1 ). This is achieved by 
reformulating their Hamiltonian approach to nonholonomic systems in the context of Dirac structures. 

The second aspect o f reduction studied in_j5] and $6] is the extension of the optimal point reduction for 
Poisson manifolds (see lOrtega and Ratiul (|2004l )) to symmetric closed Dirac manifolds. The Dirac optimal 
reduction theorem has as corollary the stratification in presymplectic leaves of a closed Dirac manifold. 
The reduction is carried out in two steps. First, one restricts the Dirac structure to the leaves of an 
appropriately chosen distribution jointly defined by the symmetry group and the Dirac structure. The 
leaves of this generalized distribution are the level sets of the optimal momentum map. Second, one passes 
to the quotient and constructs on it the reduced Dirac structure. It is not possible to extend this result in 
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a naive manner to non-closed Dirac structures because the first consequence of non-closedness is the non- 
integrability of the distribution used in the previously described reduction process. However, under certain 
integrability assumptions imposed on another distribution, it is possible to extend the ideas in Marsden- 
Weinstein reduction to nonholonomic systems. This is achieved in iJ7] These integrability conditions are 
certainly strong since they imply that the nonholonomic Noether 1-forms that descend to the quotient are 
exact. This is not true in general but holds in the case of certain systems such as the vertical rolling disk 
or the constrained particle. In order to present this nonh o lonom ic re duction method , we re formulate the 
nonh olonomic No ether Theorem (see iBates and Sniatvckil (|1993T ). §6, ICushman et al.l ((1995), Theorem 2, 
and iBlochl ((20030) 

on the Hamiltonian side and give a n explanation for certain constants of motion that 
sometimes appear as a consequence of this theorem fsee iFasso et al. ( 20071) ). 

Conventions. Throughout the paper M is a paracompact manifold, that is, it is Hausdorff and every open 
covering admits a locally finite refinement. The orientation preserving rotation group SO(2) of the plane R 2 
is also denoted by S 1 and consists of matrices of the form 



cos a 
sin a 



— sin a 
cos a 



a g 



2 Generalities on Dirac structures 



This section briefly summarizes the key facts from the theory of Dirac manifolds needed in the rest of the 
paper. It also establishes notation, terminology, and con ventions, si nce the s e are not uniform in the literature. 
The p r oofs of the statements b elow c an be found in ICourant] ( 1990b ) , Blankenstein and van der Schaft 
(120011 ). lBla"nkenstein and Ratiul ((200l . iBursztvn et all |2007l ). 

Throughout this paper we shall use the following notation, li E ^ M is a smooth fiber bundle over a 
manifold M, the spaces of smooth global and local sections are denoted by r g i b a i(-E) and F(E), respectively. 
For example, X(M) := T(TM) denotes the Lie algebra of smooth local vector fields endowed with the usual 
Jacobi-Lie bracket [X, Y](f) = X[Y[f}] - Y[X[f]}, where X, Y G X(M), / is a smooth (possibly only locally 
defined) function on M, and X[f] := £xf = df(X) denotes the Lie derivative of / in the direction X. If 
A fc (M) — > M denotes the vector bundle of exterior fc-forms on M then £l k (M) := T(A k (M)) is the space of 
local /c-forms on the manifold M. 



2.1 Dirac structures 

For a smooth manifold M denote by (•, •} the duality pairing between the cotangent bundle T*M and 
the tangent bundle TM or fi 1 (M) and X(M). The Pontryagin bundle TM © T*M is endowed with a 
nondegenerate symmetric fiberwise bilinear form of signature (dim M, dim M) given by 

((^mi OLrn)i i^mi 

) (i) 

for all u m , v m € T m M and a m , j3 m G T^M. A Dirac structure (see Courant (1990b)) on M is a Lagrangian 
subbundle D C TM © T*M, that is, D coincides with its orthogonal relative to (pj and so its fibers are 
necessarily dim M-dimensional. 

The space T(TM © T*M) of local sections of the Pontryagin bundle is also endowed with a K-bilinear 
skew-symmetric bracket (which does not satisfy the Jacobi identity) given by 

[(X, a), (Y, (3)} : = ([X, Y], £ x (3 - £ Y a + U (a(Y) - (3(X)) 
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= ( [X,Y},£ x [3-i Y da-U((X,a),(Y,[3)) 



(2) 



(see lCouran3(|l990bl )). The Dirac structure is closed or integrable if [T(D),T(D)] CT(D). Since ((X, a), (Y, [3)) 
if (X,a),(Y,/3) G r(.D), closedness of the Dirac structure is often expressed in the literature rela- 
tive to a non-skew-symmetric bracket that differs from ([2]) by eliminating in the second line the third 
term of the second component. This truncated expressio n which satisfie s the J a cobi identity but is n o 



longer skew-symmetric is c al led the Courant bracket (see iBursztvn et al.1 (|2007| ). iBursztyn et al 
Bursztvn and Crainid (|2005f ) . ILiu et al.l (|l997t ). ISevera and Weinsteinl (|200lh V 



(2004). 



1 A somewhat restricted version of the momentum equation was given in lKozlov and Kolcsnikov (1978|); see also lArnol'd et all 
(1988) 
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2.2 Distributions 



We wil l need a f ew standard facts from t he theory of generalized dist ributions on a smooth manifo l d M (see 
Stefan (Il974allbl I 1980), Sussmann (Il973h for the or iginal articles and lLibermann and Marld (|1987l ). IVaismanl 



j200lii . or lOrtega and Ratiul (|2004h . for a quick review of this theory) 



A generalized distribution A on M is a subset of the tangent bundle TM such that A(m) :— A n T m M is 
a vector subspace of T m M . The number dim A(m) is called the rank of A at m G M . A local differ entiable 
section of A is a smooth vector field X G %{M) defined on some open subset U C M such that X(u) G A(m) 
for each u £ U. In keeping with our previous notations, T(A) (respectively r g i b a i(A)) denotes the space of 
local (respectively global) sections of A. A generalized distribution is said to be differentiable or smooth if 
for every point m G M and every vector v G A(m), there is a differentiable section X G T(A) defined on an 
open neighborhood U of m such that X(m) = v. The distribution A is locally finite if for each point m G M 
there exists a neighborhood U of m and smooth vector fields Ai , . . . , A& defined on U such that for all m! 
in we have 

A(m') =span{Xi(m / ),...,A^(m / )}. 

Note that a locally finite distribution is necessarily smooth. 

The term distribution is usually synonymous to that of a vector subbundle of TM . Since we shall work 
mostly with generalized distributions, we shall call below all generalized distributions simply distributions. 
If the generalized distribution happens to be a vector subbundle we shall always state this fact explicitly. 

In all that follows, A is a smooth distribution. An integral manifold of A is an injectively immersed 
connected manifold ll '■ L > M, where lt, is the inclusion, satisfying the condition T m LL(T m L) C A(m) 
for every m G L. The integral manifold L is of maximal dimension atmSl if T m iL{T m L) = A(m). The 
distribution A is completely integrable if for every m G M there is an integral manifold L of A, m G L, 
everywhere of maximal dimension. The distribution A is involutive if it is invariant under the (local) flows 
associated to differentiable sections of A. The distribution A is algebraically involutive if for any two smooth 
vector fields defined on an open set of M which take values in A, their bracket also takes values in A. Clearly 
involutive distributions are algebraically involutive and the converse is true if the distribution is a subbundle. 
The analog of the Frobenius theorem (which deals only with vector subbundles of TM) for distributions is 
known as the Stefan-Sussmann Theorem. Its statement is the same except that one needs the distribution to 
be involutive and not just algebraically involutive: A is completely integrable if and only if A is involutive. 

Recall that the Frobenius theorem states that a vector subbundle of TM is (algebraically) involutive if and 
only if it is the tangent bundle of a foliation on M. The same is true for distributions: A smooth distribution 
is involutive if and only if it coincides with the set of vectors tangent to a generalized foliation. To give 
content to this statement and elaborate on it, we need to quickly review the concept and main properties of 
generalized foliations. 

A generalized foliation on M is a partition $ := {£j a }aeA of M into disjoint connected sets, called 
leaves, such that each point m G M has a generalized foliated chart (U,ip : U — > V G R dlmM ), m G U. 
This means that there is some natural number p a < dimM, called the dimension of the leaf £ Q , and a 
subset S a G R dimM -f= such that tp(U n L a ) = {(x 1 , . . . , x dimM ) G V | (x p "+\ . . . , z dimM ) G S a }. The 
key difference with the concept of foliation is that the number p a can change from leaf to leaf. Note 
that each (xo° +1 , ■ ■ ■ , a; dlmM ) G S a determines a connected component (U D £> a )o of U H £ Q , that is, 
ip((U PI £ a )o) = {(x 1 ,. . . , x Pa , Xo Q+1 , ■ ■ ■ , ^ dlm M ) G V}. The generalized foliated charts induce on each leaf 
a smooth manifold structure that makes them into initial submanifolds of M. 

Recall that a subset N c M is an initial submanifold of M if N carries a manifold structure such that the 
inclusion i : N <— > M is a smooth immersion and satisfies the following condition: for any smooth manifold 
P an arbitrary map g : P — > N is smooth if and only if i o g : P — > M is smooth. The notion of initial 
submanifold lies strictly between those of injectively immersed and embedded submanifolds. 

A leaf L a is called regular if it has an open neighborhood that intersects only leaves whose dimension 
equals dimiL Q . If such a neighborhood does not exist, then L a is called a singular leaf. A point is called 
regular (singular) if it is contained in a regular (singular) leaf. The set of vectors tangent to the leaves of $ 
is defined by 

T(M,d) := |J |J T m L a c TM. 
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Under mild topological conditions on M a generalized foliation has very useful properties. Assume that 
M is second countable. Then for each p Q -dimensional leaf Z a and any generalized foliated chart (U, if : 
U — > V G R dlmM ) that intersects it, the corresponding set S a is countable. The set of regular points is 
open and dense in M. Finally any closed leaf is embedded in M. Note that this last property is specific 
to (generalized) foliations since an injectively immersed submanifold whose range is closed is not necessarily 
embedded. 

Let us return now to the relationship between distributions and generalized foliations. As already men- 
tioned, given an involutive (and hence a completely integrable) distribution A, each point to G M belongs 
to exactly one connected integral manifold £ m that is maximal relative to inclusion. It turns out that 
£ m is an initial submanifold and that it is also the accessible set of to, that is, L m equals the subset of 
points in M that can be reached by applying to m a finite number of composition of flows of elements of 
r(A). The collection of all maximal integral submanifolds of A forms a generalized foliation 3A such that 
A = T(M, 3a)- Conversely, given a generalized foliation # on M, the subset T(M,3 r ) C TM is a smooth 
completely integrable (and hence involutive) distribution whose collection of maximal integral submanifolds 
coincides with These two statements expand the Stefan- Sussmann Theorem cited above. 

In the study of Dirac manifolds we will also need the concept of codistribution. A generalized codistribution 
S on M is a subset of the cotangent bundle T*M such that S(m) := SflT^M is a vector subspace of T^M. 
The notions of rank, differentiate section, and smooth codistribution are completely analogous to those for 
distributions. 

If A C TM is a smooth distribution on M, its (smooth) annihilator A° is defined by 

A°(m) := {a(m) \ a G Q}{M), (a, X) = for all X G X(U), to G U open, 
such that X(u) G A(u) for all u G U}. 

We have the, in general strict, inclusion A C A 00 . A similar definition holds for smooth codistributions. Note 
that the annihilators are smooth by construction. If a distribution (codistribution) is a vector subbundle of 
TM (respectively of T*M), then its annihilator is also a vector subbundle of T*M (respectively of TM). If 
A is a subbundle then A = A°° and similarly for codistributions. 

2.3 Characteristic equations 

A Dirac structure defines two smooth distributions Go, Gi C TM and two smooth codistributions P , Pi C 
T*M: 

Go(to) := {X(m) G T rn M | X G X(M), (X, 0) G T(D)} 

Gi(m) := {X(m) G T m M | X G X(M), there is an a G ^ 1 (M),such that (A, a) G T(D)} 

and 

P ( m ) : = {a(m) G T* n M \ a G f2 1 (M), (0, a) G T(D)} 

Px(to) := {a(m) G T r * n M \ a G ^(M), there is an X G X(M), such that (A, a) G r(£>)}. 

The smoothness of Go, Gi, Po, Pi is obvious since, by definition, they are generated by smooth local sections. 
In general, these are not vector subbundles of TM and T*M, respectively. It is also clear that Go C Gi and 
Po G Pi. The distributions Go, Gi are related to the codistributions P and Pi through the operation of 
taking annihilators. 

The characteristic equations of a Dirac structure are 

(i) G = P?, P = GJ. 

(ii) Pi c Gg, Gi G Pg. 

(iii) If Pi has constant rank, then Pi = Gg. If Gi has constant rank, then Gi = Pg. 
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The following facts are useful in the study of Dirac structures. 

A. Let P be a constant rank codistribution on M and b : P° — > (P )* a skew-symmetric vector bundle 
map (in every fiber). Then D C TM © T*M defined for every m £ M by 

D(m) := {(X(m),a(m)) £ T m M®T^M \ X a smooth local section of P°,a £ fi^M), a| P o = X b } (3) 

is a Dirac structure on M. 

Conversely, if D is a Dirac structure on M having the property that Gi C TM is a constant rank 
distribution on M, then there exists a skew-symmetric vector bundle map b : Gi — > G| such that D is given 
by © with P := P = GJ. Also, ker(b : Gi -> G?) = G . 

B. Let G be a constant rank distribution on M and jj : G° — > (G )* a skew-symmetric vector bundle map 
(in every fiber). Then D C TM ®T*M defined for every m £ M by 

D{m) := {(X(m),a(m)) £ T m M ®T^M \ a a smooth local section of G°,X £ £(M),A| G o = a 11 } (4) 

is a Dirac structure on M. 

Conversely, if D is a Dirac structure on M having the property that Pi C T*M is a constant rank 
codistribution on M, then there exists a skew-symmetric vector bundle map jj : Pi — ► PJ such that D is 
given by © with G := G = P°. Also, ker(jj : Pi -> P*) = P . 

If D is a closed Dirac structure on M then Go and Gi are algebraically involutive distributions. Hence, if Gi 
is in addition a subbundle of TM, it is integrable in the sense of Frobenius. Analogously, if the codistribution 
Pi has constant rank, i.e., Pi C T*M is a subbundle, then Go = Pi C TM is an involutive subbundle and 
thus integrable. 

A function / £ C°°(M) is called admissible if d/ £ T(Pi). If the Dirac structure D on M is closed, there 
is an induced Poisson bracket {■, - }d on the admissible functions given by 

{f,g} D =X g [f] = -X f [g}, (5) 

where Xf £ X(M) is such that (Xf, df) £ T(D). Note that Xf £ X(M) is not uniquely determined by this 
condition. If the Dirac structure is not closed, we get with the same definition an almost Poisson structure, 
that is, the Jacobi-identity doesn't necessarily hold. 



2.4 Integrable Dirac structures as Lie algebroids 

The statement of integrability of Gi in the preceding subsection can be extended to closed Dirac structures 
without the assumption of constant dimensionality of the fibers of Gi. To formulate this well-known result 
in detail, we need a short presentation of Lie algebroids. 

A Lie algebroid E — > M is a smooth vector bundle over M with a vector bundle homomorphism p : E — > 
TM, called the anchor, and a Lie algebra bracket [■, ■] : T(E) x T(E) — > T(E) satisfying: 

1 . p is a Lie algebra homomorphism 

2. for all / £ C°°(M) and X,Y £ T(E): 

[XJY]=f{X,Y]+p(X)[f]Y. 



It is shown in ICourand (|l990bh that for an arbitrary Lie algebroid E —> M, the smooth distribution p(E) is 



completely integrable. 

Assume that D is a closed Dirac structure. Then, relative to the Courant bracket ([2]) and the anchor 
7Ti : D TM given by the projection on the first factor, D becomes a Lie algebroid over M. The smooth 
distribution 7Ti(D) C TM coincides with Gi. Indeed, v m £ tti(D) if and only if there is some a m £ T*M 
such that (v m ,a m ) £ D(m). However, D is a vector bundle and hence admits local sections. Therefore, 
the point m £ M has an open neighborhood U 3 m and there are X £ X(U) and a £ il 1 (C/) s uch that 
v m = X (m) and a(m) = a m , which is equivalent to v m £ Gi(m). Furthermore, Theorem 2.3.6 in lCourantl 
(1990b) states the following result. 



G 



Theorem 2.1 An integrable Dirac structure has a generalized foliation by presymplectic leaves. 

The presymplectic form com on a leaf N of the generalized foliation is given by 

Lu N (X,Y)(p) = a(Y)(p) = -p(X)ip) (6) 

for all p G N and 1,7 e X(N), where in ■ N <—* M is the inclusion and X, Y G T(Gi) are jjy-related to 
X,Y, respectively; we shall denote Ziv-relatedness by X ~i N X and Y ~i N Y. The 1-forms a, j3 G S1 1 (M) 
are such that (X, a), (Y, (3) G T(D). Formula (J6j) is independent of all the choices involved. Note that there 
is an induced Dirac structure on N given by the graph of the bundle map b : TN — > T*N associated to uim 
(see SjU]). 



2.5 Implicit Hamiltonian systems 

Let D be a Dirac structure on M and H G C°°(M). The implicit Hamiltonian system (M,D,H) is defined 
as the set of G°° solutions x(t) satisfying the condition 



(x,dH(x(t))) G D(x(t)), for all t. 



(7) 



In this general situation, conservation of energy is still valid: H(t) = (dH(x(t)), x(t)) — 0, for all t for which 
the solution exists. In addition, these equations contain algebraic constraints, namely, dH(x(t)) G Pi(x(t)), 
for all t. Note that x(t) G G±(x(t)), so the set of admissible flows have velocities in the distribution Gi. 
Thus, an implicit Hamiltonian system defines a set of differential and algebraic equations. 

Note that if Gi is an involutive subbundle of TM 1 then there are dimM — rankGi independent conserved 
quantities for the Hamiltonian system ([7]). We want to emphasize that standard existence and uniqueness 
theorems do not apply to {7J, even if all the distributions and codistributions are subbundles. The only gen- 
eral theorems that ensure the local exis tence and uniquenes s of solution s for J7]) are for the so-called implicit 
Hamiltonian systems of index one (see Blankensteinl ( 2000h . Blankenstein and van der 



2.6 Restriction of Dirac structures 

First, we describe the restriction of Dirac structures to submanifolds. Let D be a Dirac structure on M 
and N C M a submanifold of M. Define the map cr(m) : T m N x T^M -> T m N x T^N, to G iV, by 
a(m)(v m , a m ) = (v m , a m \T m N)- Assume that the dimension of Gi(to) n T. m N is independent of to G N and 
that the rank of Gi is constant on M. Define the vector subbundle Dm C TN T*N by 

D N {m) = ct(to) (D(m) n (T m N x T^M)) , to G N. 

Then D jy is a Lagrangian subbundle in the Pontryagin bundle TN @T*N and is thus a Dirac structure on 
N. 

Let i : AT > M denote the inclusion map and define for all to G Af 

S a (m) := {(JC(m),a(m)) G T m Af x T^M | a G f2 1 (M),X G X[M) such that 
X(n) G T n N for all n G N for which AT is defined} 

(where the subscript s stands fo r subm anifold) . This defines a smooth bundle E s = U TO gjv^ s (m) on A". 
Blank enstein and van der Schafti (|200ll ) show that under the assumption that the fibers of E s R D have 



constant dimension on M, there is another way to give the induced Dirac structure, namely, {X, a) is a 
local section of Dm if and only if there exists a local section (X, a) of D such that X ~ t X and a — c*a. 
Otherwise stated, 

T(D N ) = {(X,a) G X(N) © Q 1 (N) \ there is (X, a) G T(D) such that X ~ 4 AT and 5 = /,*«}. (8) 

Furthermore, if I? is closed, then Dm is also closed. As stated in blankenstein and van der Schafti (|20mh . if 
Gi is constant dimensional, the assumptions for both methods of restriction are equivalent. 

Second, we recall the restriction construction for implicit Hamiltonian systems. Given is the implicit 
Hamiltonian system (M, D, H) and JVcMan invariant submanifold under the integral curves of (M, D, H) 



7 



(if they exist). Define Hn := H\n = H o l. Then every solution x(t) of (M, D, H) which leaves N invariant 
(that is, x(t) C N) is a solution of (N, Dn, Hn)- The converse statement is not true, in general. 

For example, assume that N C M is such that every X G Gi is tangent to N, that is, X(n) E T n N, for all 
n G N. Then the solutions of (M, £>, if) contained in N are exactly the solutions of the implicit generalized 
Hamiltonian system (TV, Dn, Hn)- 

Another interesting example of the restriction construction appears under the following hypotheses. As- 
sume that D is closed and Gi is a vector subbundle of TM. Recall that there exists a skew-symmetric vector 
bundle map b : Gi — > G* with kernel Go, such that 

D{m) = {(v m ,a m ) G T m M x T* m M \ a m - \>{v m ) G Gi(m)°, « G Gi(m)}, m G M. 

Since in this case Gi is algebraically involutive and constant dimensional, it is integrable in the sense of 
Frobenius. Hence Gi defines a foliation partitioning M into integral submanifolds of Gi. 
Restricting D to such an integral submanifold N yields 

D N {m) = {(v m ,Om) E T m N x T^N \ a m = b(v m ), for all m G N}, 

where b is the restriction of b to N. Then b defines a closed 2-form on N with kernel Go- Hence Dn is a 
presymplectic structure on N. This leads to a special case of Theorem 12.11 In particular, the restriction 
(N, Dn, Hn) is a presymplectic Hamiltonian system on TV. 

3 Reduction of Dirac structures 

In this section we introduce Lie group and Lie algebra symmetries of a Dirac manifold. Then we present two 
of the three symmetry reduction methods of Dirac structures found in the literature and show that they are 
equivalent. 

3.1 Lie group and Lie algebra symmetries 

Let G be a Lie group and $ : G x M — > M a smooth left action. Then G is called a symmetry Lie group of 
D if for every g G G the condition {X,a) G T(D) implies that ($* X, $*a) G T(D). We say then that the 
Lie group G acts canonically or by Dirac actions on M . 

For any admissible / G C°°(M), i.e., a function such that (Xf,df) G T(D) for some Xf G X(M), this 
yields ($*X /; $*d/) G T(D) or ($* g X f , d($*/)) G T(D). Hence we have simultaneously the facts that $*/ 
is admissible and that &* g Xf — A$* f =: Y € L(Go). This implies for the almost Poisson bracket on admissible 
functions (see 12. 

%{f,h} D = -%(X f [h]) = -($* g X f W g h] = -(Y + X nf W g h] 

= -d($* g h)(Y + x nf ) = -d($ g h)(x nf ) = {$;/, & g h} D 

since is an admissible function (and hence d(Q*h) G Pi G Gq and Y G r(Go)). 

The Lie group G is a symmetry Lie group of the implicit Hamiltonian system (M, D, H) if, in addition, H 
is G-invariant, that is, H o $ 9 = H for all g G G. 

Let g be a Lie algebra and £ G g i— > £m £ 3£(M) be a smooth left Lie algebra action, that is, the map 
(x, £) G M xj h ^mC^) S TM is smooth and £ G g i— » £m G 3C(M) is a Lie algebra anti-homomorphism. 
The Lie algebra q is said to be a symmetry Lie algebra of D if for every £ G the condition (X, a) G L(D) 
implies that (j£f M X, £^ M a) G I\D). If, in addition, £^ M H = for all £ G g, then g is a symmetry Lie 
algebra of the implicit Hamiltonian system {M, D, H). Of course, if q is the Lie algebra of G and £ i— > £m is 
the associated infinitesimal generator, then if G is a symmetry Lie group of D it follows that q is a symmetry 
Lie algebra of D. 

3.2 The reduction methods 

There are three reduction procedures of Dirac structures in the literature. Two of them are standard 
and appear in various works, the third one is still in the stage of development and is considerably more 
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general ( Cendra et al. I (l2008h . We shall review the two established procedures here and show that they are 



equivalent. 

In all that follows we shall assume that G is a symmetry Lie group of the Dirac structure D on M and 
that the action is free and proper. Thus, the projection on the quotient 7r : M — > M/G := M defines a left 
principal G-bundlc. Note that the Dirac structure D C TM © T*M is G-invariant as a subbundle since for 
all g G G and (X, a) £ T(D) we have (&*X,Q*a) G r(Z3). Recall that the infinitesimal generators £m for 
£ G g are also G-equi variant: for all £ G q we have 

^M = (Ad„-i^MGSM (9) 



(see, e.g., Marsden and Ratiul (1999), Lemma 9.3.1), where qm ■— {vm \ V G £)} C X(Af). Define for m G M 



the vector subspace V(m) := {£m( to ) I £ G g} C T m M and the distribution V := U me jvfV(m). Since the 
G-action is free, V is a vector subbundle of TM. The subbundle V is G-invariant (see ©). It is worth 
noting that the space of sections F(V) coincides with the G°° (M)-module spanned by Qm- The identity 
<!>*V = V° for all g G G follows immediately. Note also that V(m) is the tangent space at to G M of the 
G-orbit through m, where the orbit is endowed with the manifold structure that makes it diffeomorphic to 
G, using the freeness of the action. 

For all to G M the map T m ir : T m M — ► T ff ( m )M is surjective with kernel V(m). This yields an isomorphism 
between T m M/V(m) and T w ( m )M. The Lie group G acts smoothly on the quotient vector bundle TM/V by 

g ■ v :— T& g (v), where v G TM/V; this action is well defined by ©. 

In what follows we shall need the following elementary observation: each section of TM/V is the projection 
of a smooth vector field on M. Indeed, pick a G-invariant Riemannian met ric on M (wh ose existence is 
gu aranteed by the paracompactn ess of M and the properness of the G-action; Palais! ( 196ll ). Theorem 4.3.1 



Duistermaat and Kolk ( 2000h . Proposition 2.5.2), decompose TM = V © V- 1 , and identify the vector 



or _ 

bundles TM/V and V- 1 . Thus sections of TM/V are identified with smooth vector fields on M taking values 
only in V . 

For X G X(M), we will say that the section X :— X(modV) of TM/V is G-equivariant, if there is a 
representative X G of X that is G-equivariant, i.e., a smooth section X G G X(M) G with X — X G G T(V). 
This is equivalent to the condition [X, V) G r(V) for all representatives X of X and for all V G T(V). This is 
the content of Corollary IA. 31 In what follows we shall use these two equivalent definitions interchangeably. 

The representative X G of X uniquely induces a smooth vector field X on M, where X is defined by the 
condition X G ~ w X, that is, Tir o X G = X o ir (see also Proposition IA.2[) . Then, for any representative Y of 
I = F we have Y -X G =:V E T(V), and hence Tir o F = Ttt o X g + Ttt o V = X o ir, which shows that 

n: T{TM/V) G -> 3C(M) , , 

X(modV) i ^ X, ^ ^ 

where X is defined by the condition X G X, that is, Ttt o X g = X o 7T, is a well defined homomorphism 
of G°°(M)-modules (note that G°°(M) ~ C°°(M) G via / ^ tt*/). This map (TO]) is in fact bijective, 
hence an isomorphism. To prove injectivity, let X and Y be G-equivariant elements of T(TM/V) with 
n(X) = n(y) = X. Then we necessarily have X G — Y G G T (V) and thus X = = _ y . The proof 
of surjectivity uses the Tube Theorem (see, e.g., Palais! ( 196l[ ) or lOrtega and Rathi (2004) Theorem 2.3.28) 



which states in the case of free proper actions that for every point to G M one can find a G-invariant open 
neighborhood U of to and a G-equivariant diffcomorphism ip : U — > G x B, where B is an open ball in the 
vector space T m M/T m (G ■ to); the G-action on G x B is left translation on the first factor. Thus ip induces a 
diffcomorphism ip : U/G — > B uniquely determined by the condition tpow\u — P2 o ip t where P2 : G x B — > i? 
is the projection on the second factor. Now if X G X(M), then 1^*X G X(_B) so that X G X(G x _B) defined 
by X(g, b) :— (0, (-0*X)(&)), for 5 G G and 6 G B, is G-equivariant. Therefore, ?/>*X G X(£7) is G-equivariant 
and we clearly have ip*X ~ w X by construction which show that the map X(modV) 1— > X is surjective. 
(The con struction o f G-eq uivariant lifted vector fields on M from vector f ields on M/G is done for compact 



(Lhe con struction o r Cr-eq uivanant luted vector fields on M irom vector nelds on MJU is done lor com] 
groups in lBierstond ( 19751 ). Theorem D, and for general proper actions in Duistermaat! . Theorem 6.10.) 



In the same way, for all a G Q 1 (M), we have ir*a G r(V°) G . Note that if a G r(V°) G , then the 1-form 
a G fi 1 (M) defined by (a(ir(m)),T m Tr(v m )) :— (a(m),v m ), for all v m G T m M, is well defined and satisfies 
ir*a = a. This shows that the map a G 1 (M) 1— > 7r*a G r(V°) G is an isomorphism of G°°(M)-modules. 
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We close these preliminary remarks by recording that the G-action on (TM/V) © V° 
is free and proper. 

A. Dirac reduc t ion a s a particular instance of Courant algebroid reduction. This method is introduced in 



Bursztvn et al.l (|2007l ) who describe a general procedure of reduction of Courant algebroids. This is then 
used to reduce Dirac structures in a Courant algebroid. A Courant algebroid over a manifold M is a vector 
bundle E — > M with a fiberwise nondegenerate symmetric bilinear form (-,-), a R-bilinear bracket [-, ■] on the 
smooth sections T(E) (not necessarily skew-symmetric), and a bundle map p : E — > TM called the anchor, 
which satisfy the following conditions for all e\, e 2 , e 3 £ T(E) and / £ G°°(M): 

1- [ei, [e 2 ,e 3 ]] = [[ei,e 2 ],e 3 ] + [e 2 , [ei,e 3 ]], 

2. p([e 1 ,e 2 \) = [p(e 1 ),p(e 2 )], 

3. [ei,fe 2 ] = /[ei,e 2 ] + {p{e 1 )[f])e 2 , 

4. p(ei)(e 2 ,e 3 ) = ([ei,e 2 ],e 3 ) + (e 2 , [e 1 ,e 3 ]), 

5. ([ei,ei],-} = |p*d(ei,ei) 

(see Bursztvn et al. ( 2007t )). In the standard case of interest to us in this paper, this reduction procedure is 



very simple and can be described as follows. 

The Courant algebroid E is the Pontryagin bundle TM(BT*M with the Courant bracket (see the discussion 
following ©): 

[(A, a), (Y, 0)] = ([A, Y], £ x - i Y da) (11) 
for all sections (A, a) an d (Y, 0) of TM ®T*M . 



We apply the results of lBursztvn et all (j2007j ) to the vector subbundle % := V © {0} C TM © T*M of the 
Pontryagin bundle and its orthogonal complement 3C 1 - — TM © V° . Both vector subbundles are G-invariant 
and it is easy to show (in agreement with the more general results of lBursztvn et al. that 



is a Courant algebroid over M with the symmetric bilinear 2-form that descends from the one on IX^/UC 
given by 

((A, a), (Y, 0)) x ± /x = /3(A) + a(Y) (13) 

for all a, in T(V°) and A, Y in X(M); here A := A(modV), Y := F(modV) denote local sections of 
TM/V induced by local vector fields on M. 

We have used above the following general fact that will be needed also in later arguments. 

Lemma 3.1 Let 7r : E — > M be a smooth vector bundle over M . Assume that there are two free proper 
G '-actions on E and M, respectively, such that tt is equivariant. Then the induced map ttq ■ E/G — > M/G 
defined by the commutative diagram 

E — ► M 

E/G wa i M/G 
is also a smooth vector bundle whose rank is equal to the rank of E. 

Proof: It is straightforward to check that the map ttq is a smooth surjective submersion and that its fibers 
are vector spaces. To prove local triviality, choose m € M and an open neighborhood U of 7Tm(w) such that 
there exists a diffeomorphism 

*: *jt(U) - UxG 

n i ^ (vr M (ri), *2("-)) 
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with W(m) = (7Tj\/(m), e). Now, since E is a vector bundle over M, there exists an open set U with 
to G J7 C 7r^ (17), a diffeomorphism 

9: tt-^U") -> Uxn-^m) 
v n i * (n,8 2 (t) n )) 

(where u n is an element of 7r _1 (n)), and an open set U' with m € U' Q U such that : 17' — > itm(U') x 
^2(^0 is a diffeomorphism and hence ^(f') an open neighborhood of e in G. Define 

A: ^{-KMiU')) -» 7r M (C/') x^'^mM) 

TTfiK) l-> (tmW, (7T£ O B 2 O $f 2( „)-l)K)) , 

where $ E : G x E — > i? denotes the G-action on E. Since, n M (U') is open in A7/G with 7t(to) e ttm(U'), 
this is a smooth local trivialization for the vector bundle ttq : E/G — > Af/G around the point 7r(m). 
The rank of E/G is computed to be 

rank(£/G) = dim(E/G) - dim(A7/G) = dim E - dim G - dim M + dim G 

= dim — dim M = rank E. □ 

In fact, with the identifications given above of r(V°) G with O x (M) and r(TA7/V) G with X(M), it is 
obvious that the G-equivariant sections of (fl"2|) are in one-to-one correspondence with those of TM © T*M. 
Note that this says that we have a vector bundle isomorphism 

nr± I 

— G ~ TM®T*M (14) 

Jv / 

over A7 = M / G. This vector bundle isomorphism preserves the symmetric pairing; indeed, for all to £ M 
and (X, a), (Y,fi) G L(TM © T*M) the bracket ( , ) A? satisfies 

((X, a), (F , ^))m(tt(to)) = a(y)(7r(m)) + /3(A > )(tt(to)) = (**£*) (y)(m) + (tt*/3)(X)(to) 
= ({X, n*a), (y tt*/3))(to) = ((X, 7r*a), (?, Z^))^/^™) 

where X and y are G-equivariant local vector fields such that X ~ w X, Y ~ w Y and X = X(modV), 
Y = y(modV). Since all chosen objects are G-equivariant and the vector bundle isomorphism (|14p is 
equivalent to the one defined on the corresponding spaces of local sections, this relation proves the statement. 



We shall prove below that the Courant bracket on TM © T*A7 also descends from the Courant bracket 
on TM © T*M in the following sense. Recall that if (X, a) and (Y, 0) are sections of TA7 © T*M, then the 
truncated Courant bracket on sections of TM © T*M is given by (see (TTT10 

[(X,a),(Yj)] = ([X,Y],£xP-iyda). 

Let X,Y £ X_(M) and a,/3 G O, 1 (M) be such that L(TA7/V) G and a,J3 G L(V°) G . Thus,_these define 

uniquely X,y G X(M), a,/3 G fi 1 (M) by the conditions X G ~ w X, y G ~ w V and ?r*a = a, n*f3 = (3, where 
X G , Y G G X G (A7) are such that X - X G =: V, Y - Y G =: W G L(V). Since X G and Y G are G-equivariant, 
the last three terms in 

[X,Y] = [X G + V,Y G + W] = [X G ,Y G ] + [V,Y G ] + [X G ,W] + [V,W], (15) 

are sections of V. Hence 

[X,Y]:=[X^YG]^\xy) (16) 

defines a Lie bracket on the G-invariant sections of T(TM/V) (where r(TAf/V) G is considered with its 
G°°(A7)-module structure). Since [X G ,Y G ] ^ [X,Y], this shows that the image of [-X\y] under the 
isomorphism of sections given in (flT)]) is exactly [X,y], the first component of [(X, a), (Y,f3)]. 
Since ir*a, ir*(3 G L(V°) G , we have for all V, W G L(V) 

£x G +v( n * 0) ~ iyG +w d(7r*a) = £ x g(tt*/3) — iyGd(7r*a) = tt* {£x/3 — iyda) . (17) 
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Thus, since for all V, W G T{V) 



[(X G + V, w*a), (Y G + W, tt*/3)] = ([X G ,Y G ] + [V, Y G ] + [X G , W] + [V, W],tt* (£jcP ~ W*)) , 
we conclude that 

is exactly the G-equivariant section of X ± /X corresponding to [(X, a), (Y,/3)]. This discussion proves the 
following. 

Proposition 3.2 The Courant bracket on TM T*M induces a well-defined bracket on the G-invariant 
sections ofX ± /X characterized by the property that if X,Y G T(TM/V) G and a, ft € r(V°) G correspond to 
I,Fe X(M) anda,/3 G f2 (M), respectively, then the bracket [(X, a), (Y,0)] corresponds to [(X,a), (Y,f3)]. 
This bracket on T(X ± /X) G , also called Courant bracket, corresponds by the quotient map in (TTi|) to the 
Courant bracket on TM ®T*M. 



Now assuming that DnX x has constant rank, that is, DCdQ x is a smooth vector subbundle of TM@T*M, it 
follows that (DnX^) 1 - = D+X and DCflC are vector subbundles of TM®T*M. The second conclusion follows 
from the fact that for all m G M we have dim(D(m) + X(m)) — dim D(m) + dimDC(m) — dim(D(m) DX(m)) 
which shows that DdX has constant dimensional fibers and is hence a vector subbundle. Form the pointwisc 
quotient 

(D n X x ) + JC _ (D n {TM V°)) + (V {0}) 

X ~ V0{O} 1 ' 

with base M. At each point m G M, one gets a subspace of the vector space (T m M/V(m)) © V°(m) ~ 
X x (m)/X(m) (see (p^])). 

Proposition 3.3 Relative to the symmetric nondegenerate bilinear form (|13[) on X x /X, the vector subspace 

s (D(m)nX(m) x )+X(m) , J^m) 1 - 
Dim) := — — or — — — r- 

v ; X(m) J X(m) 

satisfies D(m) = D(m) x . 

Proof: Let us prove that Dim) C D(m) x . Let (X(m),a(m)) G D(m). If (X, a) G T(Z)) are local sections 
about m, then a G T(V°) and there are X G X(M) and V G F(V) such that (X + V, a) G T(D) and 
X = X(modV). For all (Y,/3) G T(D) we have analogously local vector fields Y G X(M) and W G T(V) 
such that (Y + W,/3) G r(£>) and F = F(modV). This yields 

{{X,a),(Y,0)) x ± /x ® ((X + F,a),(F + W,/3)) = 0, 

since (X + V,a), (Y + W,0) G r(D). 

To prove the inclusion, D(m) x C D(m) let (X(m),a!(m)) G Dim) 1 - be such that (X, a) G r(3C J -/^) and 
for all (y,/3) G r(£>) we have ((X,a), (Y,f3)) x ± /X = 0. Choose X G 3C(M) such that X = X(modV). For 
all (Y,(3) G T(D n DC- 1 ), (9,(3) lies in r(£>) and we get 

= ((X, a), (Y, (3)) x ± /x = ((X, a), (Y, /?)) = a(Y) + f3(X). 

This yields {X, a) G T((D n K^). We have (£), n JC^) 1 - = D x + (X x ) x = D q + X q for every g in the 
domain of definition of (X, a). Thus, since D and X are smooth vector bundles, there exists X' G X(M) 
and W G T(V) such that (X' , a) G T(D) and X = X' + W. Now recall that the 1-form a is in fact in 
r(V°) since (X, a) was an element of T(X X /X). The pair (X', a) is consequently in r(D n X ) and, since 
X = (X' + VF)(mod V) = X'(mod V), our (X, a) is a local section of D, as required. □ 

This proposition immediately implies that dim Dim) is constant on M and equal to 

dimX x (m) - dimDCfm) dim M + (dim M - dim G) — dim G ,. , r ,. ^ 
— ; — 1 = = dim AY — dimG. 
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Thus I) is a smooth G-invariant subbundle of DC -1 /DC. Its image by the isomorphism (TT4")) gives a subbundlc 
D red = D/G of 

— / G ~ TMffi T*M, 
DC / 

whose rank is (dimM — dimG), which is isotropic relative to the symmetric pairing on TM © T*M. Hence 
-D rc d is a Dirac structure called the reduction of D by G. This discussion and Proposition 13.31 yield the 
following consequence. 

Proposition 3.4 The sections of -D ro d are in one-to-one correspondence with the G-equivariant sections of 
the quotient (fTS)) via the isomorphisms X(M) ~ r(TM/V) G and f2 x (M) ~ r(V°) G given of £fte beginning of 
this subsection. 

It is customary to denote the "quotient" Dirac structure on M/G by 

(jnDC^) + DC / 

Proposition 3.5 If the Dirac structure D is closed then the reduced Dirac structure D re d is also closed. 

PROOF: The proof is based on the fact that the Courant bracket on the G-invariant sections of DC /DC 
descends to the Courant bracket on TM © T*M. Indeed, if (X,a), (Y,j3) £ r(D rc d), consider the corre- 
sponding G-invariant sections (X, ir*a), (Y, tt*(3) of ((D n DC^) + DC) /DC. Let now X G , Y G £ X(M) G be such 
that X = X G + V, Y = Y G + W, where V, W £ T(V). Since (X G + V, n*a) and (Y G + W, tt*/3) are sections 
of (D n DC^) + DC and D is closed, we get as in ([17]). 

[(X G + V,Ti*a),(Y G + W,tt*P)} = ([X G + V,Y G + W},tt* (£xP-iya)) £ T(D n DC^ + DC). 

Thus, from (fT5| and (Tl~6|) we deduce that 

However, by Proposition 13.21 f[X,Y],ir*(£xP — iy-da)^ descends precisely to the Courant bracket 

[(X,a),(Yj)] = ([X,Y],£ x P-iyda). 
Therefore, [(X, a), (Y,0)] £ r(D re d) which proves that D re d is closed. □ 



B. Dirac reduction as an extension of Poisson reductio n. T his was historically the first me t hod t o re- 
duce Dirac structures and it is due to Blankenstei n ( 2000) and Blankenstein and van der SchaftJ ( 200 if ) (see 
Blankenstein and Ratiul (|2004 ) for the singular case). 
Define for all m £ M the vector subspace 

E(m) = {(X(m),a(m)) £ T m M x T* m M \ X £ X(M),a = n*a for some a £ f2 x (M)}. 

Then E := U m6 M-E(w) = TM © V° = DC^ is a vector bundle and thus the assumption 

D HE is a vector subbundle of TM © T*M 

of Blankenstei n and van der Schaftl (|200lh is identical to the assumption 

D n DC 1 is a vector subbundle of TM © T*M 



of iBursztvn et al.l (|2007T ). which is in turn equival ent, as we have seen before, to the h ypothesis that 
the fibers of D n DC are constant dimensional. In Blankenstein and van der Schaft (2001), there is the 
additional assumption that V + G n is c onstant dime nsional on M. Their proof is based on results in 
Niimeiier and van der Schaft (1990) and llsidoril (1995). They also need V to be an involutive subbundlc 
of TM , which holds in our case since the action o f G on M is free and proper. The cited result of llsidori 



1 19951 ) (and of Niimeiier and van der Schaft (1990) with a stronger hypothesis) is exactly the statement of 
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Proposition IA.1I applied to the invol utive subbundle V of T M and the generalized distribution Go- Our 
proof of this proposition, inspired bv lCheng and Tarnl <|l989l ). needs only that Go is a locally finite smooth 



distribution and that V is an involutive vector subbundle of TM. 



To s ummarize, the hypothesis needed for the two methods of reduction are not equivalent; in Burszty n et al 



'2001) one needs only that D n "K is a subbundle ofTM © T*M and in Wlankenstein and van der Schafi 
2001 ) one needs the additional a ssumption that Go is a locally fini t e smo oth distribution. 



The reduced Dirac structure of iBlankenstein and van der Schaftl (|2001r ) is given by 

T(D) = {(X,a) G T(TM © T*M) | there isle X(Af) such that X ~ w X and (X,n*a) G r(Z>)}. (19) 

Proposition 3.6 The sections of D are exactly those of _D rc d and the vector bundles D and -D re d a^e 
identical. 

Proof: Choose (X,a) G T(D). By (JTSJ), there exists X G X(M) such that X ~ w X and (X,Tr*a) G T(D). 
Since there exists X G G X(M) G such that X G ^ X, we have with X — X G that X is a G-equivariant 
section of TM/V. The 1-form a := vr*a lies in r(V°) G and thus we have (X, a) 6 r(£> n 3C- 1 ) and 



(.AT, a) G T 



Thus this section corresponds to a unique section (X rc( j, a re( j) of -D re d- Now this section is given by X ^ n 
^red, which yields X = X Te d. The equality 7r*a re d = a = n*a implies that a re d — <5 because it is a surjective 
submersion. □ 

The description of D re d shows that the smooth distribution Go/V projects to G ed and that the smooth 
codistribution n 2 (D n (V © V°)) projects to P^ cd , where tt 2 is the projection tt 2 : TM ®T*M -> T*Af. There 
is no analogous description as quotients of the distribution G5 cd , and P™" 1 ; they need to be computed from 
the definition on a case by case basis. 

Depending on the example, one needs to choose which method of Dirac reduction is easier to implement. 
In the next section, we will present cases where we have global bases of sections for the Dirac structure and 
in that situation the first method is more convenient. 



The t hi rd me thod of reduction alluded to at the beginning of this subsection is due to lYoshimura and Marsden 



(|2006allbLl2007h . It is undergoing a major extension t o encompass both the Lagrangian and Hamiltonian 



version of classical reduction (see Cendra et al. ( 2008f )). Since this work is still in progress we shall not 
comment on it here. 



4 Reduction of nonholonomic systems 

4.1 Summary of the nonholonomic reduction method 

Bates and Sniatvckil (|l993h propose a reduction method for constrained Hamiltonian systems. They start 



with the configuration space Q, a hyperregular Lagrangian L : TQ — > M taken as the kinetic energy of a 
Riemannian metric, and a constraint distribution D on Q equal to the kernel of smooth 1-forms <p l , . . . , (j> k G 
f2 1 (Q) satisfying pointwise 1 A . . . A <f) k ^ 0, that is, 

D := {v e TQ | <jP(v) = 0, j = l,...,fc}. 

The independence of the forms (which is equivalent to the hypothesis (ft 1 A . . . A 4> k ^ at every point of Q) 
ensures that D is a smooth vector subbundle of TQ. 

Denote by (• , •} : T*Q x TQ — > M the duality pairing between 1-forms and tangent vectors. Let FL : 
TQ -> T*Q, 

L(v + tw), v,w£T q Q, 

t=o 



(¥L(v),w) := - 



be the Legendre transformation associated to L which is a diffeomorphism since the Lagrangian is hyper- 
regular. If A(v) := (¥L(v),v) denotes the action of L, let H(p) := A((¥L)- 1 (p)) - L((FL)" 1 (p)), p G T*Q, 
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be the associated Hamiltonian. The Hamiltonian vector field X determined by H and the constraint forms 
(p 1 , . . . , (j) k G is defined classically by 

. dH . dH . l4 

1 = ~~5 — ' P ~ "5 1" A J'^ 

op 05 

or 

i x cj = dH + \ J n* T , Q 4P (20) 

where 7Tt*q : T*Q — > Q is the cotangent bundle projection and Ai,...Afe G C°°(Q) are the Lagrange 
multipliers associated to the constraint forms , . . . , (f> k , and the constraint equations 

cfPiWL- 1 (p)) = ^(Tttt-qX) = 0. (21) 

The counterpart of the constraint distribution D in phase space is the constraint manifold 

M := FL(D) = {p G T*Q | <^>t*q(p)) ((FZ)" 1 ^) = 0, j = 1, . . . , fe} C T*Q. (22) 

Since we require that the solution be in the constraint submanifold M, it follows that X is tangent to M. 

Set wm := i*w can , where i : M ^ T*Q is the inclusion and w can is the canonical symplectic form on T*Q. 
Define 

5F : = {U G TT*Q | ir^ Q ^{U) = 0, j = 1, . . . , fc} (23) 

and note that tt^q^ 1 A . . . A TT^, Q (j) k ^ on T*Q. Therefore 3" — > T*Q is a vector subbundle of TT*Q. The 
nonholonomic horizontal distribution is defined by 

M:= Jfl TAf — > M. (24) 



Bates and Sniatvcki (Il993h prove that the restriction uj{ of to Jf x "K is nondegenerate. Their proof 



uses the fact that the Lagrangian is the kinetic energy of a metric plus a potential. They also show that !K 
is a vector subbundle of TM. With the condition (|2ip on X, we get for j = 1, . . . , k, 

ir* T « Q ^(X) = ( fS(Tir T , Q X) = Q 

and thus the vector field A" is a section of !K. Hence it is easy to see that the pull back to M of (|2U)l subject 
to the constraints (f2"Tj) is equivalent to X £ T(!K) and ix^M = d^|w- 

Assume that G is a Lie group acting symplectically on T*Q (not necessarily the lift of an action on Q), 
leaves M invariant, and preserves the Hamiltonian H. Assume that the quotient M = M/G is a smooth 
manifold with projection map 7T : M — > M a submersion. Since G is a symmetry group of the nonholonomic 
system, all intrinsically defined vector fields and distributions push down to M. 

In particular, the vector field X on M pushes down to a vector field X with X ^ X and the distribution 
IK pushes down to a distribution 5C re d on M. However, lo^c need not push down to a 2-form defined on J{ re d 
on M, despite the fact that to^c is G-invariant. This is because there may be infinitesimal symmetries £m 
which are horizontal (that is, take values in !K), but i^ M u)^c 7^ 0. Let V be the distribution on M tangent to 
the orbits of G, that is, its fibers are V(m) := {£m(w) £ G 9} for all meMC T*Q. Define the horizontal 
annihilator U of V by 

u = (v n , KY M nKc tm c tt*q, (25) 

where the superscript u>m on a distribution denotes its fiberwise WA/-orthogonal complement in TM. Clearly, 
It and V are both G-invariant, pr oject down to M, and the image of V is {0}. Define !K := Ttt (U) C TM 
to be the projection of 11 to M. Bates and Sniatvcki ( 1993f ) show that X takes values in It and that the 



restriction u>u of uim tollxll pushes down to a nondegenerate form lo^- on 3€, i.e., tt*ujj{ = ujy_- In addition, 
the function H G G°°(Af) defined by 7r*if = H\m and the induced vector field X on M are related by 

ix^ft = dH\M (26) 

which can be interpreted as the definition of the reduced nonholonomic Hamiltonian vector field X. 

Remark 4.1 Note that we have no information about the dimensions of the fibers of U. In general, U is 
not a vector subbundle of TM. A 
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4.2 Link with Dirac reduction 

Let M, lom, 7Tt*q, ?f, M, and 7r : M — > M be as in the preceding subsection. An easy verification shows 
that 

5C = (TCttt-qIm))- 1 ^) C TM C TT*Q, (27) 

where 

D := {« G TQ | (^,t>) = 0, j = l,...,k} C TQ 

is the constraint distribution on Q. 

We introduce the Dirac structure D on M as in Yoshimura and Marsden ( 2006bl ): for all m G M define 



D(m) = {(X(m),a m ) e TM 8 T*M \ A G r(5C), a - \ x u M G r(IK )} (28) 

and let D := U meM D(m) C TM. 

The Lie group G acts on M and leaves Ji, u>m> an d thus the Dirac structure D invariant. Define % := 
V 8 {0} C TM 8 T*M and its orthogonal complement 3C 1 - = TM 8 V° as in SgOl Assume, as in SgOl that 
D (~l 3C 1 - is a vector subbundle of TM 8 T*M and consider the reduced Dirac manifold (M, -D rc d)- The next 
proposition shows that, if J£ is constant dimensional, the reduced Dirac structure is given by the formula 

D rcd = {(A, a) G r(TM 8 T* M) \ X G r(5C), = 

where !H and cj^ are defined as in the preceding subsection. 

Proposition 4.2 (i) The generalized distribution Go is trivial and the codistribution P\ is given by Pi = 
T*M. 

(ii) Let U = % n (V n 'K) UM {see (25])). Then 

X G r(U) there exists a G T(V°) sucft tfcai (A, a) G r(D n DC X ). (29) 

Wi£/i £/ie additional assumption that V + J{ = TM, i/ie section a in (|29[) is unique. 

(iii) T/ie reduced distributions G\ od and Gg cd are given by 

G[ cd = 3C and G r cd = {0}. 

(iv) Tor eac/i a G r(V°) i/iere exists exactly one section X G r(U) suc/i </iai (A, a) G r(D). Hence, we 
have iT2(D n DC ) = V° and i/ie reduced codistribution P™ d is eg-aa/ to T*(M/G). 

(v) Assume that G™ d — §i is constant dimensional. The 2- form defined on G™ d — 'K by the Dirac structure 
-Dred (see ([5])) is nondegenerate and is equal to uj^. 

Proof: (i) If A is a section of Go, we have ix^M G r(!H ) and A G r(!K). Hence, since oj-k is nondegenerate, 
the vector field A has to be the zero section. Thus Go = {0}. 

Since the 2-form lu^c is nondegenerate an arbitrary a G f2 1 (M) determines a unique section A of 'K by 
the equation ixu-h = Oi\j{. Therefore, Pi = T*M. 

(ii) If (A, a) is a local section of D fl 3C-, then we have A G r(5£), a £ r(V°), and a = ix^M on JC. 
Hence, (ix^Af) :xnv = and thus we have 

a g r(M n (v n J£) WM ) = r(U). 

Conversely, if X G r(U), we have ix^M = on V fl % and we can find a section a G r(V°) such that the 
restriction of a and ix^M to IK are equal. 

If, in addition, we make the usual assumption V + H = TM, we have for each A G r(U) exactly one 
a 6 fi 1 (Af) such that a\-H — ix^M and a\y = 0. 

(iii) By construction, the constraint distribution Gi cd associated to the Dirac structure -D re d on M is given 

by 



16 



This can obviously be identified with 

% = Tk{U). 

If we have X g r(G^ od ), then (X,0) g r(D red ) and there exists X g X(M) with X ~ w X and (X,0) g 
L(£>). Hence we have X 6 r(G ) and since G = {0}, we get X = 0. This shows that Gf, od = {0}. 

(iv) This follows directly from (i) and (ii). 

(v) Let uj Drcd be the 2-form defined on Gf d _= ft by the Dirac structure D lcd (see ©). If X g r(IK) 
is such that W£) red (X, Y) = for all Y g r(J£), then (X, 0) is a section of D rc d and hence we have by 
(iii) X = 0. Thus wr> rod is nondegenerate on "K. Let X and Y be sections of IK. We show now that 
u Dted (X,Y) = Indeed, by definition, we have cj Drcd (X,Y) = a(Y), where a,0 G 9^{M/G) 
are "such that (X,a),(Y,/3) g r(L> rod ). Choose X,Y g r(U) with X ~„. X, Y ~„. Y and (X,7r*a), 
(y 7r*/3) g F(L> n ft- 1 ). Then we have 

u Dred (X,Y) = a{Y) = (7r*a)(Y)=u J u(X,Y)=^(X,Y), 

where the last equality follows simply from the definition of ui^. □ 

We shall use part (ii) of this proposition to simplify certain computations in the examples that follow. 

Remark 4.3 Note that if "X + V has constant rank on M, we have automatically that D D ft 1 " has constant 
dimensional fibers on M. 

Since JC, V, 3i + V are vector subbundles of TM, JCflV is also a subbundle of TM. By the nondegeneracy 
of uj^c, we get that U = {% (~l V)" M n = (!K n V) W;K has also constant dimensional fibers on M and is 
in particular a vector subbundle of J£. Let it be the dimension of the fibers of U, r the dimension of the 
fibers of "K. Then, if n = dimM, n — r is the rank of the codistribution !K . Let finally I be the rank of 
the codistribution Jf n V° = (V + Jf)° C Choose local basis vector fields Hi, . . . ,H r for "K such that 
Hi, ... , H u are basis vector fields for IX. In the same way, choose basis 1-forms j3i,..,, f3 n - r for Jf° such 
that (3i,...,Pi are basis 1-forms for V° n "K . Then a local basis of sections of D is 

{{Hi,i Hl u M ), (H r , i Hr u M ), (0, Pi), . . . , (0, /?„_,.)} ■ 

The considerations above show that D n ft 1 " is then spanned by the sections 

{/ n—r \ / n—r \ ^ 

H u i Hl uj M + J2 >■■■,( H u , i Hu u M + <Pi > (0, Pi), ■ ■ ■ , (0, Pi) > , 

V 1=2+1 / V »=;+i / J 

where are smooth functions chosen such that ih^m + Y^i=i+\ a jPi e r(V°) for j = 1, . . . , u. Since these 
sections are linearly independent, they are smooth local basis sections for D n ft . A 

4.3 Example: the constrained particle in space 



Bates and Sniatvckil (|1993I ) study the motion of the constrained particle in space. The configuration space 
of this problem is Q := ]R 3 whose coordinates are denoted by q := (x, y, z). They take the following concrete 
constraints on the velocities: 

D := ker(dz - ydx) = {v x d x + v y d y + v z d z \ v z - yv x = 0} C TQ. 

The Lagrangian is hyperregular and taken to be the kinetic energy of the Euclidean metric, that is, £(q, v) := 
i||v|| 2 . Hence the constraint manifold is five dimensional and given by 

M := {(x,y,z,p x ,p y ,p z ) | p z = yp x } C T*Q, 

where (x,y, z,p x ,p y ,p z ) are the coordinates of T*Q. The global coordinates on M are thus (x,y, z,p x ,p y ). 
The pull back lom of the canonical 2-form lu on T*Q to M has hence the expression 

uj m = dx A dp x + dy A dp y + dz A (p x dy + ydp x ). 

The Dirac structure D modeling this problem is given by (f2"5| . Formula (|2T|) gives the vector subbundle 

■K:= {T^t-qMT 1 ^) =s V axi{d x + yd z ,d y ,d Poi ,d Pv } C TM, 
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and consequently 

J-C° = span{dz — ydx}. 

A computation yields 

h. x + v d,u M = (1 + y 2 )dpx + VPxdy 
id y UM = dp y -Pxdz 
id Py u M = -dy 
' l d p ^M = -ydz - dx. 

Hence 

{ (d x + yd z , (1 + y 2 )dp x + yp x dy) ; (d y ,dp y - p x dz) ; (d Py ,-dy) ; (d Px ,-ydz - da;) ; (0,dz - ydx) } (30) 

is a smooth global basis for D. 

We consider the action of the Lie group G = R 2 on M given by 

$ : G x M -> M, $((r,s),m) = (x + r, y, z + s,p x ,p y ), 

where m := (x,y,z,p x ,p y ) £ M . This R 2 -action is the restriction to M of the cotangent lift of the action <f) : 
GxQ — > Q, s), (x, y, zj) = (x+r, y, z+s). It obviously leaves the Hamiltonian H(m) = ^{(^+y 2 )p x +P y ) 
on M invariant. Note that if (X, a) 6 r(D) we have 

(£ 6M X,£ 6M a)eT(D) for all ^ = 1 2 . 

Since the vertical bundle in this example is V = spanjc^, d z }, we have 

X = V © {0} = span{(9 2; , 0), (0„ 0)} C TM © T*M 

and thus 

X 1 - = TM © V° 

= span{(d x , 0), (fy, 0), (0„ 0), , 0), (0 p> , 0), (0, dy), (0, dp s ), (0, d Py )} (31) 
A direct computation using (|3"0|) and (|3"Tj) yields 

D n 3C 1 - = span { (d Py , -dy) , (d x + yd z , (1 + y 2 )d Px + y Px dy) , ((1 + y 2 )d y - «p x a p . , (1 + y 2 )dp y ) } 
and 

(D n 3C X ) + DC = span { (9 Pj/ , -dy) , (d x , 0) , (9 2 , 0) , (0, (1 + y 2 )d Pa; + y^dy) , 

((l+y 2 )d y -y^,(l+y 2 )d P!/ )} 

since in this case {D n 3C 1 -) n 3C = {0}. 

Note that there is an easier way to compute the spanning sections of D n X 1 - by using (J2U). First, one 
determines spanning sections of 11. Second, for each spanning section X 6 r(U) we find A 6 C°°(M) such 
that 

i x u M + A(dz - ydx) eT{V°). 

Third, setting a := ix^Af + A(dz — ydx) we have found a spanning section (X, a) S T(Z) n DC -1 ). In the 
following examples, we will proceed like this. 
We get the reduced Dirac structure 



(DnX^+X 

T> T ed — — 



'r-ar.™! { d py~ d y) > (0,(l + y 2 )d^ + yp x dy) , \ 
P I ((i + y 2 R - y Px d Px , (l + y 2 )d Pa ) J 



X 

on the three dimensional manifold M :— M/G with global coordinates (y,p y ,p x ). 



18 



Since d x + yd z is a spanning section of "K fl V, the distribution U C TM (see (|2"5)l ) is given by 

It = (V n l K) u}M n "K = ker{i aaj+ ^w M } n JC 
= ker{(l + y 2 )dp x + y^dy} n M 
= span { (1 + y 2 )d y - yp^ , d x + yd z ,d Py } . 

Thus 

JC = T7r(U) = span{9 Py , (1 + y 2 )d y - y Px d p J 

recovering the result in Bates and Sniatvckil ( 1993h . Note that, as discussed in §4.2| the d istribution "X C TM 
coinci des with the projection on the first factor of the reduced Dirac structure (|32|) . As in Bates and Sniatvckil 
(|l993f ). % is an integrable subbundlc of TM; in fact [d Py , (1 + y 2 )d y — yp x 9 Px ] = 0. The 2-form uj^ is easily 
computed to equal 

^(d Py , (1 + y 2 )d y - VPxdpJ = -dy((l + y 2 )d y - yp x d p J = -(1 + y 2 ). 

As predicted by the general theory in §4.1| u>^ is nondegenerate. 

It is easy to check that the reduced manifold M is Poisson relative to the 2-tensor 



-d y A d Py + 



1 



y 



or with Poisson bracket determined by {y,p y } = —1, {y,p x } = 0, {p y ,p x } = yp x /(l + y 2 ), and that -D re d 
given by ([32]) is the graph of the vector bundle homomorphism b : T*M —> TM associated to the Poisson 
structure. 



4.4 Example: the vertical rolling disk 

T his example is standard in the theory of nonholonomic mechanical systems; it can be found for example 
in Bloch ( 20031 ). Consider a vertical disk of zero width rolling on the xy-plane and free to rotate about 



its vertical axis. Let x and y denote the position of contact of the disk in the xy-plane. The remaining 
variables are 9 and ip, denoting the orientation of a chosen material point P with respect to the vertical and 
the "heading angle" of the disk. Thus, the unconstrained configuration space for the vertical rolling disk is 
Q := R 2 x S 1 x S 1 . The Lagrangian for the problem is taken to be the kinetic energy 

L(x,y,9,(p,x,y,d,ip) = t;M(± 2 + y 2 ) + ^I9 2 + ^Jfi 2 , 

where /i is the mass of the disk, and /, J are its moments of inertia. Hence, the Hamiltonian of the system 

is 111 

H(x,y,9,ip,p X7 p y ,p g ,p (p ) = —(p 2 x +p 2 y ) + —p 2 e + — p% ■ 

The rolling constraints may be written as x = R9 cos tp and y = R9 sin ip, where R is the radius of the disk, 
that is, 

T> := {(x,y,9,(p,R6cosip,R9sm(p,6,(p) \ x, y e E, 9, tp e S 1 } C TQ. 

Note that the 1-forms defining this distribution D are <pi := dx — Rcosipd.9 and (j>2 ■= dy — Rs'm(pd9. 
The constraint manifold (|22l) 

{fiR fiR 1 

(x,y,9,tp,p x ,p y ,p e ,p v ) G T*Q p x = —pecosip, p y = —p e smtp I C T*Q 

is in this example a graph over the coordinates (x, y, 9, ip,pe,p v ) and is hence six dimensional. The induced 
2-form lum = i*u can is given by the formula 

/ uR cos ifi , llR sin ip , \ , / nR sin ip , uR cos ip , \ 
uj m =dx A f - ^ dp e - - p e d<p\ + dy A f f dp e + J ^ p e dtp\ 

+ d9 A dpe + dip A dp v 
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and the distribution J{ = kcrjda; — Rcos(pd8, dy — Rsmtpd9} C TM is in this case 

IK = span{<9 v , do + Rcosipd x + Rsimpdy, d pg ,d Pv } C TM. 
Therefore its annihilator is 

%° = span{dx - i?cos<pd6»,dy - RwupdB} C T*M. 

The Dirac structure on M describing the nonholonomic mechanical system is again given by 

^Rshup fiRcosip 
*d v u M = dp v H p e dx p e dy, 

uR cos ip , uR sin ip 
ia Pe w M = j dx dy - d9, 

i 9pip UM = -dtp, 



(32) 



Since 



and 



/ fiR cos ip fiRshup 

ld0+B.coB<pd v +Rsm<pd y VM = dpg + R COS ip I dpg j dip 

_ . / uR sin w , uR cos tp , 
+ .Rainy I - - dp e + - - y d(^ 

1 + I dp e 



we get again smooth global spanning sections of D: 



<9»,dp ¥ 



//i? sin ip 



idx 



/xi? COS w 

— - 1 



■)dy 



uR cos ip , uRsmtp , 
da; ay — du 

(dp ,—d(p) , (0, dx — i?cos ipdff) , (0, dy — RsimpdO) 



dg + Rcosipd x + Rsimpdy, ( 1 



UR 2 



In this case, several groups of symmetries are studied in the literature. 



dp t 



(33) 



1. The case G = R 2 (jCantriin et alj (|1998fl ). 
The Lie group R 2 acts on M by 

(r, s) ■ (x,y,6,ip,p e ,p v ) = (x + r, y + s, 6, ip,pe,P<p) 

and clearly leaves the Hamiltonian H invariant. The distribution V on M is in this case V 
spaniel, d y }, so that Vn!K = {0}by (|3"2"|) . Therefore, in this example, U = "K. We have 



X = V © {0} = span{(9 x ,0), (d y ,0)} C TM © T*M 



and 



X 1 - = TM ffi V° 

= span 0), (fy, 0), (de, 0), 0), (d P9 , 0), 0), 

(o,d Pv ),(o,dip),(o,dpg),(o,de)}. 
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By (p?5)) and the fact that V + J£ = TM, we know that for each spanning section X of %, there exists 
exactly one a € r(V°) such that the pair (X, a) is a section of 13 n X . Using ([33]) and the equalities 



/u,R sin 03 /ii? cos ip 
id^M pg{dx - Rcos(pd9) H (dy - Rsimpdu) = dp v (34) 



we find 



uR cos 09 

ia P8 ^M H j pe(dx - Rcosipd6) 

+ ^^( dy - RsinvdO) = - (l + 0£ ) d« (3.-,) 



D n K x = span | (d v , d Ptp ) , fa, , - (l + dfl) , (d Pv , -d<p) , 

/ ui? 2 \ 

<9e + Rcostpd x + Rsimpdy, I 1 H — I dp f 



Hence 



(D n 3C X ) + % = span j dp^) , fa , - (l + ^) dO^j , (d p<p , -dip) 



d e ,[l + ^)dp e ),(d x ,0),(d y ,0) 



and finally we get the reduced Dirac structure 



(DnX^+X I ( (dv,d Pv ,),(d pe ,-(l + *f)d6),(d Pv ,-dip) 

Dred = ~ / G = span { , I2 \ ' ) (36) 

x I \ (d e ,(l + ^-)dp e ^ ( 



on the four dimensional manifold M — M/G with coordinates (ip,9,p v ,pg). Thus, Z? r cd is the graph 
of the symplectic form on M given by cj rc d = dip A dp v + (1 + ^j-)dO A dpg. 

As already mentioned, in this example, U = J£ and hence 5i = T-k{"K) = spanjc^, d p , dg, d pg } by (|32|) 
which coincides with the projection on the first factor of the reduced Dirac structure (|36|) . In this case 
"K = TM and so to^ — cj rc d is of course nondegenerate. 

2. The case G = SE(2) (|Blochl (|2003h L 

The Lie group SE(2) := § 1 (S)R 2 is the semidirect product of the circle S 1 identified with matrices of 
the form 

cos a — sin a 
sin a cos a 

and acting on R 2 by usual matrix multiplication. Denote elements of SE(2) by (a, r, s) where 
and r,s£i Define the action of the Lie group SE(2) on M by 

(a, r, s) ■ (x,y,9,ip,pg,p v ) = (x cos a — y sin a + r, x sin a + y cos a + s,9,ip + a,pg,p v ) 

and note that the Hamiltonian H is invariant by this action. The distribution V on M is in this case 
V = span{9 x , dy,d v } and we get 

X = V &) {0} = span{(&, 0), (d y , 0), (d v , 0)}. 

Thus 

X 1 - = TM © V° = span {(8 X , 0), 0), (dg, 0), 0), (0 Ps , 0), (9^,0), 

(0,dp v ),(0,d w ),(0,d#)}. 
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We have V n "K = span{d v } (see (J22)) and hence 



, , i fJ-R sin ip , iiR cos tfi 
(V n JtT M = ker d Pip + f - — t-p e dx - P - - Pe dy 



so that 



IX = % (~l ker ( dp v H - Pedx - pedy 



/ii? sin tp /iR cos (p 
nodx 

span{<9 v , de + R cos <p + R sin y> <9 y , d Pe }. 



Using (03]), ([23]), and we get 



Dn3C x =span|(^,d^), (d pe ,- (l + d9 j . 

de + R cos ^9^; + i? sin t^dy , ( 1 H — I dpt 



Thus, 



(Dnx^ + x l„ \ (o,d P(p ),(d Pe ,-(i + ef)d6 

£>rcd = " ^ / SPai1 



ft, (1 + *f )d Pe ) 



is the graph of the Poisson tensor 

defined on the manifold M := M/G with coordinates (0,pg,p v ). 
In addition, 

"K — Tir(VL) — spanjft, <9 P(9 } 

is an integrable subbundle of TM (since [dg, d pe \ = 0). Note that the projection on the first factor of 
■Dred equals !H. Finally, the 2-form ui^ is easily computed to be 

^■k {de,d pg ) = 1 + ^— 

and, as predicted by the general theory, it is nondegenerate on 5£. 
3. The case G = 8 1 x M 2 (|Blochl (|2003h ). The direct product Lie group 8 1 x R 2 acts on M by 

(a,r, s) ■ (x,y,6,(p,p g , Pv ,) = (x + r,y + s,6 + a,ip,pe,p<p). 
The distribution V on M is in this case V = spanjc^, d y ,de}, 

X = V® {0} = span{(a x ,0), (0„,O), (ft,0)}, 

and thus 

3C 1 - = TM © V° 

= span {(ft, 0), (ft, 0), (ft, 0), (ft, 0), (ft 9! 0), (^,0), (0, dp,,), (0, dp 6 ), (0, dip)} . 
Using (j52")l we get Vfl JC = spanjft + R cos tpd x + Rs'mipdy} and hence 

(V n , KY M = ker ( f 1 + dp b 
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Therefore, again by (|3"2|) we conclude 



U = •K n (V n Jf) WflI = J{ n ker <j ( 1 + ^p- I dp 6 



= spanjc^ ,dg + R cos tpd x + R sin (/? 9j, , d Vv } . 
Using ([55]) and (j3~i|) . we obtain 

L> fl = span | (d^&py) , (<9 Pv5 , -dip) , (d g + Rcosipd x + Rsimpdy, ^1 + j 



and hence 



Aod = / G = spsai{(d {p ,dp v ) ,(d Pv ,-d(p) ,(0,dp e )} 



which is the graph of the Poisson tensor 



on the three dimensional reduced manifold M = M/G with coordinates {<p,p<p,pe)- We have 

"K = Tn(U) = span{d v , d Pv } 

which is an integrable subbundle of TM (since [d^,d p ] = 0). As before, the projection on the first 
factor of -D re d equals !H. The 2-form uj^ has the expression 

ujfc (d v ,d Pv ) = 1 

and, as the general theory states, it is nondegenerate on !H. 

4.5 Example: the Chaplygin skate 

The standard Chaplygin skate. This example can be found in lRosenbere (1977). It describes the motion 



of a hatchet on a hatchet planimeter, that behaves like a curved knife edge. It is now commonly known under 
the name of "Chaplygin skate". Let the contact point of the knife edge have the coordinates x,y S 1R 2 , let 
its direction relative to the positive cc-axis be 9, and let its center of mass be at distance s from the contact 
point. Denote the total mass of the knife edge by m. Thus the moment of inertia about an axis through the 
contact point normal to the xy plane is I = ms 2 . The configuration space of this problem is the semidirect 
product Q := SE(2) = S 1 ©^ 2 whose coordinates are denoted by q := (9,x,y). We have the following 
concrete constraints on the velocities: 

D := kcr(sin 8dx — cos 9dy) = span {cos 9d x + sin 9d y , dg} C TQ. 

The Lagrangian is hyperregular and taken to be the kinetic energy of the knife edge, namely, 

L(6, x, y, 9, x, y) = -m(x — s9 sin 9) 2 + ^m(y + s9 cos 9) 2 

= —m(x 2 + y 2 ) + —ms 2 9 2 + ms9(y cos0 — xs'm9), 

where we have used that the x and y components of the velocity of the center of mass are, respectively, 

x — s9 sin 9 and y + s9 cos 9. 

Compute 



dx 
dL 

dy 

dL 



p x = — — = mx — ms9 sm f 
ox 



p v — = my + ms9 cos ( 
dy 



= — r = ms 2 9 + ms(ycos9 — xsinfl). 
88 
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In Ts we have ycos9 — xsin9 — and hence we get for (9,x,y,pg,p x ,p y ) in the constraint submanifold 
M C T*Q: 

pg = ms 2 9 and p x sin 9 = mx sin 9 — ms9 sin 2 



my cos t 



ms9(l — cos 2 i 



= my cos 9 + ms9 cos — ms9 

= p y COs9 - -pg. 

Hence the constraint manifold M is five dimensional and given by 

M := {(9,x,y,pe,pa;,py) | pg = sp y cos 9- sp x sin9} C T*Q. 

The global coordinates on M are thus (9,x,y,p x ,p y ). The pull back ojm of the canonical 2-form oj on T*Q 
to M has hence the expression 

u>m = dx A dp x + dy A dp y + d9 A d(sp a cos 9 — sp x sin 0) 

= dx A dp x + dy A dp y + s cos 9d9 A dp y — s sin 9d9 A dp x . 

The Dirac structure D modeling this problem is given by (|28|) . Formula ([27]) gives the vector subbundle 

Oi := (T(7TT.Q|M))" 1 (2)) = span{cos^+sm^ !/ ,a e ,^,9 p JcTM, 

or equivalently 



A computation yields 



Jf = span{sin#dx — cos#dy}. 

icosea,+sinea B w M = cos9dp x + sin9dp y 

i-de^M = scos9dp y — ssm9dp x 
i dpy ojm = -dy - s cos 9d9 
ig ojm = dx + s sin 8d8. 



Hence 



{ (cos#<9 x + sm9d y ,cos9dp x + sm9dp y ) ; (dg, scos9dp y — ssin0dp x ) ; 
(d Py , — dy — s cos 9d9) ; (d Px , da; + s sin 9d9) ; (0, sin 9dx — cos 9dy) } 

is a smooth global basis for D. 

We consider the action of the Lie group G = SE(2) on Q, given by 

4> : G x Q — > Q, 0((a, r, s), (0, x, y)) = (0 + a, cosaa; — sin ay + r, sin ax + cos ay + s). 

Thus, the induced action on <f> : G x T*Q — > T*Q is given by 

r , s ), (0, x , V,Pe,p x ,Py)) 
= (9 + a, cos ax — sin ay + r, sin ax + cos ay + s,pg,cosap x — sin ap y , sin ap x + cosap y ). 

The action on Q obviously leaves the Lagrangian invariant. We show that the induced action on T*Q leaves 
the manifold M invariant: we denote with 9' , x' , y' ,p' x ,p y ,p'g the coordinates of $((a, r, s), (9, x, y,pg,p x ,p y )) 
and compute 

s cos 9 p y — ssm9'p' x —scos(9 + a)(sinap x + cosap y ) — ssin(# + a)(cosap x — sinap y ) 
=s(cos#cosa — sin # sin a) (sin ap^ + cosap y ) 

— s (sin 9 cos a + cos 9 sin a) (cos ap x — sin ap y ) 
=s cos 9p y — s sin 9p x = pg = p'g. 



24 



Since the vertical bundle in this example is V = sp&n{dg, d x , d y }, we have V n "K = sp&n{dg, cos 9d x + 
sin6*<9 y } and (V n JfJ^ 11 = kerjcos 9dp x + smddp y ,scos9dp y — ssin9dp x } = ker{dp x , dp y }. Hence the 
distribution U = (V n < K) UM n "K is given by span{<9#, cos 09 2 + sin 9d y } and 

£) n X J ~ = span {(cos 09a; + sin9d y ,cos9dp x + sin^dp^) , (dg, scos9dp y — s sin9dp x )} . 

We get the reduced Dirac structure 

(D n + 3C / 

D ro d = — / G = span { (0, cos 9dp x + sin 9dp y ) , (0, s cos 8dp y — s sin 9dp x ) } 

= span{(0,dp x ) , (0,dp y )} 

on the two dimensional manifold M :— M/G with global coordinates (p x ,p y ). Note that this is the graph of 
the trivial Poisson tensor on M. 

The Chaplygin skate with a rotor on it. We propose here a variation of the previous example by 
considering the Chaplygin skate with a disk attached to the center of mass of the skate that is free to rotate 
about the vertical axis. Again, let the contact point of the knife edge have the coordinates x, y G M 2 , let 
its direction relative to the positive x-axis be 9, and let its center of mass be at distance s from the contact 
point. Denote by m the mass of the knife edge. Thus its moment of inertia about an axis through the 
contact point normal to the xy plane is I = ms 2 . Let <f) be the angle between a fixed point on the disk and 
the positive x-axis and J be the moment of inertia of the disk about the vertical axis. The configuration 
space of this problem is Q := S 1 x S 1 x R 2 whose points are denoted by q := (</>, 6, x, y). We have again the 
following concrete constraints on the velocities: 

T> := ker(sin#d:r — cos6*dy) = span {cos8d x + sin6*<9 y , dg} C TQ. 

The Lagrangian is the kinetic energy of the knife edge: 

L(<j>, 9, x, y, 4>, 6, x, y) = ^m(x - s9 sin 9) 2 + ^m(y + s8 cos 9) 2 + ^J(9 + <j)) 2 

= ^m(x 2 + y 2 ) + i(7 + J)8 2 + ms9 (ycos9 - xsinfl) + ij^ 2 + J $6. 

Compute 

p x — rax — ms9 sin 9 
p y = my + ms9 cos 9 

pg = (I + J)9 + ms(y cos 9 — x sin 9) + J<f> 
P4> = J{4> + 8). 

Again, if we have ycos9 — x sin 9 — 0, we compute: 

pg = (I + J)9 + J4> = 19 + J(9 + <j>) = 18 + p = ms 2 8 + V4> 

and 

p x sin 8 = mx sin 8 — ms9 sin 2 9 = my cos 8 — ms9(l — cos 2 8) 

= my cos 8 + ms8 cos 2 8 — ms8 = p y cos 9 H — [p^ — pg). 

Hence the constraint manifold M is seven dimensional and given by 

M := {{(t>,6,x,y,p,j„pe,Px,Pv) I Pe = sp y cos8 - sp x sm9 + P4,} C T*Q. 

The global coordinates on M are thus (0, 8, x, y.p^.Px.Py). The pull back % of the canonical 2-form ui on 
T*Q to M has hence the expression 

u>m = dx A dp x + dy A dp y + d9 A d(sp y cos 9 — sp x sin 9 + p^) + d(f> A dp^ 

= dx A dp x + dy A dp y + s cos 9d9 A dp y — s sin 9d9 A dp x + (d9 + d(f)) A dp^. 
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The Dirac structure D modeling this problem is given by (f2"5)) . Formula (f2"T)) gives the vector subbundlc 

J£ := {T\v T * Q \ M )Y x {V) = spa t n{d <t> ,dg,cos9d x +sm9d y ,d p ^d Px ,d Py } C TM, 
or equivalently 



A computation yields 



%° = spanjsin 6 Ax — cos6*dy}. 
ia^M = dp^ 

i-d e wm = s cos #dj3j, — s sin 9dp x + Ap^ 
icoBe^+sin edy^M = cos9dp x + sm 9 Ap y 
ia^^M = -d0 - d0 
ia PH ^M = — dy — scos9d9 
[q lom = Ax + s sin6>d6>. 



We get 



D = span { (d<{,, dp^) ;(dg,s cos9dp y — ssin^dpz + dp,/,) ; (cosfldz + sin 9d y , cos 9dp x + sin 9 Ap y ) ; 

(d P4i ,—d9 — d(/>) ; (d Py , —dy — s cos 9d9} ; (d Px , —dx + s sin 9A9) ; (0, sin 9dx — cos 9dy) } . 

We consider the action of the Lie group G = S 1 x SE(2) on Q, given by 

4> '■ G x Q — > Q, cf>((f3,a,r,s), {4>, 9,x,y)) = (4> + P,9 + a, cos ax — sin ay + r, sin ax + cos ay + s). 

Thus, the induced action $ : G x T*Q -> T*Q on T*Q is given by 

$((/?, a, r,s), ((j>,9,x,y,p 4> ,pg,p x ,py)) 

— {4> + (3,9 + a, cos ra — sin ay + r, sin as + cos ay + s,pe, cos ap x — sinap y , sina^ + cos ap y ). 

The Lagrangian is invariant under the lift to TQ of 4> an d it is easy to see, with the considerations in the 
previous example, that the induced action <!> on T*Q leaves the manifold M invariant. 

Since the vertical bundle in this example is V = span{c^, dg, d x , d y }, we have VnJi = span{c?0, dg,cos 9d x + 
sin9d y } and (VnJC) WM = kerjdp^, cos 9dp x + sin 9dp y , s cos 9dp y — s sin9Ap x + dp^} = kerjdp^, dp x , dp y }. 
Hence the distribution U = (V n %) UM H !K is given by span{c>0, dg, cos 9d x + sin 9d y } and 

D n X 1 - — span {(c^, dp^) , (cos 9d x + sin 9d y , cos 9dp x + sin 9dp y ) ,(dg,s cos 9dp y — s sin 9dp x + dp^)} . 

We get the reduced Dirac structure 

D ™ d x l G 

= span { (0, dpc/,) , (0, cos 9dp x + sin 9dp y ) , (0, s cos 9dp y — s sin 9dp x + dp,p) } 
= span{(O,dp ) , {0,dp x ) , {0,dp y )} 

on the three dimensional manifold M := M/G with global coordinates (p^,p x ,p v )- This is again the graph 
of the trivial Poisson tensor on M. 

In these six examples we get integrable Dirac structures after reduction. We shall come back to this 
remark in the last section of the paper. 

5 The optimal momentum map for closed Dirac manifolds 
5.1 Definition of the optimal momentum map 

Let (M, D) be a closed Dirac manifold, G a symmetry Lie group of D acting freely and properly on M . 
Assume in the following that D^X 1 - is a vector bundle, where X = V©{0} C TM®T*M and X^ = TM x V° 
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(see ij3.2p . To define the optimal momentum map (as in Ortega and Ratiu ( 2004 )) we need to introduce an 
additional smooth distribution. Define 

D G (m) := {X{m) | there is a G T(V°) C Q 1 (M) such that (X, a) G T(D)} C Gi(m) 

for all m 6 M. Then = UrngM^GH * s a smooth distribution on M. 

If the manifold M is Poisson and the Dirac structure is the graph of the Poisson map ft : T*M — » TAf , 
then Ugh) = \Xf{y) J there is / G G°°(Af) G such that X/ = ft(d/)}, which recovers the definition in 



Ortega and Ratiul (|2004l) . 



Returning to the general case of Dirac manifolds, note that 

Dq = tti(D n (TM x V )) = tt\{D n DC -1 ) (37) 

and that we always have 

Go C D G C Gi. 

The following lemma will be helpful to show the integrability of the distribution D G . 

Lemma 5.1 Let (X,a),(Y,f3) G r(L>n3C ± ), i.e., X,Y G r(D G ). T/iera tfie l-/orm .£ x /3 - iyda is a local 
section ofV°. 

Proof: It suffices to show that (£ x f3 — Wda)(S,M) = for all (eg. Since D is G-invariant, we have 
(£ iM X,£ iM a) G r(L>) for all (eg. Since (Y,(3) G r(L>), we conclude f3{£ iu X) + [£ tM a){Y) = or 
f3(£ (M X) = -(£ (M a)(Y). Thus we get 

(£ x [3 - i Y da)(£ M ) = £x(/?((m)) - /3(«m) - da(Y,£ M ) 

= £ x (0) + f3(£ iM X) - Y[a(Z M )] + fa[a(Y)] + a(£ Y ( M ) 
= -(£ 6u a)(Y) - £ Y (0) + £ iM (a(Y)) - a(£ iM Y) = 0, 

where we used /?(£m) = «((m) = since a, (3 G r(V°). □ 

Lemma 5.2 // D is integrable, the space of local sections of the intersection of vector bundles D D 3C 1 - is 
closed under the Courant bracket. Hence, under the assumption that D n %^ has constant dimensional fibers, 
this vector bundle inherits a Lie algebroid structure relative to the truncated Courant bracket on T(D n 
and the anchor map n\ : DDOC 1 ^ — > TM. Thus, the distribution Dq = tti(D (~l DC^) is integrable in the sense 
of Stefan- Sussmann. 

Proof: Since D is integrable, the space of its local sections is closed under the Courant bracket, and hence 
for all {X, a), (Y, /?) G T(D n X. 1 -) we have (see ©) 

[(X, a), (Y, (3)} = {[X, Y], £ x (3 - i Y da) G T(D). (38) 

LemmaOimplies that £ x f3-i Y da G I\V°). Thus, [(X, a), (Y, (3)] G TpnSC^) and hence [X,Y] G r(D G ). 
The remaining statements follow immediately. □ 

Thus, if D (~i 3C 1 - is a vector bundle, M admits a generalize d foliation by the l eaves of the generalized 
distribution Dq. The optimal momentum is now defined like in lOrtega and Ratiul (|2004f ). 

Definition 5.3 Assume that D n is a vector subbundle of TM @ T*M . The projection 

3 : M -> M/D G (39) 

on ifte leaf space of Dq is called the (Dirac) optimal momentum map. 

In order to formulate in the next section the reduction theorem with this optimal momentum map, we 
need an induced action of G on the leaf space of Dq. This doesn't follow, as usual, from the G-equivariance 
of the vector fields spanning Dq because, in this case, they are not necessarily G-equi variant. 

Lemma 5.4 If m and m' are in the same leaf of Dq then <& g (m) and $ 9 (m') are in the same leaf of Dq 
for all g G G. Hence there is a well defined action $ : G x M/D G — > M /Dq given by 

**(3(m)) := 0(* a (m)) (40) 
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Proof: Let to and to' be in the same leaf of T)q. Without loss of generality we can assume that there 
exists X e T{Dg) with flow F x such that (m) = to' for some t (in reality, to and to' can be joined by 
finitely many such curves). Since (X, a) G T(D) for some a G F(V°) and D is G-invariant, it follows that 
G r(L>) for all .g G G. Because a G r(V°), for all £ € we have 

<(^a)(p),^(p)) = (a($ fl (p)),(Ad fl -iOw(* fl (p))>=0 
which shows that $*a G r(V°). Hence, $*X G r(D G ) for all g E G. For all s G [0,i] we have 

^ o ) (m) - T F x {m) % {X(F*(m))) 

= ($;-J)ft(ffH) G D G (<l> fl (F*(m))- 

Thus the curve c(s) = (<& g o F*) (m) connecting c(0) = <& 9 (m) to c(i) = $ 9 (F { x (m)) = $ 9 (m') has all its 
tangent vectors in the distribution 2) G and hence it lies entirely in the leaf of Dq through the point $ g (m).D 

Denote by G p the isotropy subgroup of p G M/Dq for this induced action. If g 6 G p and m G 3^ 1 (p), 
then 

0(* fl (m)) = * fl (3(m)) = * g (p) = P = d(m) 

and we get the usual fact that G p leaves invariant. Thus we get an induced action of G p on 

which is free if the original G-action on M is free. 

Also, 3~ 1 (p) is an initial submanifold of M since i t is a leaf of the g e nerali zed foliation defined by the 



integrable distribution T)q. By Proposition 3.4.4 in I Ortega and Ratiul (|2004h . there is a unique smooth 
structure on G p with respect to which this subgroup is an initial Lie subgroup of G with Lie algebra 

9p = {£ 6 fl I &r(m) G T m r X {p), for all m G d^ip)}- 
In general, G p is not closed in G. 

5.2 The universality of the optimal momentum map 

Definition 5.5 Let (M, D) be a Dirac manifold with integrable Dirac structure D and G a Lie group acting 
canonically on it. Let P be a set and J : M — > P a map. We say that J has the Noether property for 
the G-action on (M, D) if the flow F t of any implicit Hamiltonian vector field associated to any G-invariant 
admissible function h G G°°(M) preserves the fibers of 3, that is, 

J o F t = J|Dom(F t ) 

where Dom(i 7 ' f ) is the domain of definition of F t . 

Like in the Poisson case (see lOrtega and Ratiul ( 2004h ). one gets the following universality property. 



Theorem 5.6 Let G be a symmetry Lie group of the Dirac manifold (M,D) and 3 : M — > P a function 
with the Noether property. Then there exists a unique map <j) : M/Dq ~* P such that the following diagram 
commutes: 

M >P 





If 3 is smooth and G-equivariant with respect to some G-action on P, then <fi is also smooth and G- 
equivariant. 



Proof: The proof is the same as for Poisson manifolds (see lOrtega and Ratiul (|2004|)). Define <f> : M/Dq — > 
P by 4>(p) '■= J(w); where p = 3(m). The map 4> is well defined since if m! G d^ 1 (p) then there is a finite 
composition £Fy of flows associated to sections of Dg such that m! = 3V (to). Since J is a Noether momentum 
map we have 

J(m') = J(9r(m)) = J(m) = (t>{p). 
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The definition immediately implies that the diagram commutes. Uniqueness of <f> follows from the requirement 
that the diagram commutes and the surjectivity of 3- Equivariance of is a direct consequence of the 
definition (|4"0)) of the G-action on M/T>g- Finally, if all objects are smooth manifolds and 3, J a re smooth 
maps then <j> is a smooth map as the quotient of the smooth map J by the projection 3 (see iBourbaki 
l|l967h L □ 



6 Optimal reduction for closed Dirac manifolds 



In thi s section we generalize the optimal reduction procedure from Poisson manifolds fsee lOrtega and Ratiu 



(2004)) ;o closed Dirac manifolds. As we shall see, with appropriately extended definitions this important 



desingularization method works also for Dirac manifolds. 



6.1 The reduction theorem 

Theorem 6.1 (Optimal point reduction by Dirac actions) Let (M,D) be an integrable Dirac mani- 
fold and G a Lie group acting freely and properly on M and leaving the Dirac structure invariant. Assume 
that D (~l 0C 1 - is constant dimensional and let 3 '■ M — > M/Dq be the optimal (Dirac) momentum map as- 
sociated to this action. Then, for any p G M/Dq whose isotropy subgroup G p acts properly on 3~ 1 (p), the 
orbit space M p = 3~ l {p)/G p is a smooth presymplectic regular quotient manifold with presymplectic form 
Lu p G Q, 2 (M p ) defined by 

(w* p uj p ) (m)(X(m), Y(m)) = a m (Y(m)) = -/3 m (X(m)) (41) 

for any to G 3 (p) and any X, Y G T(T>q) defined on an open set around m, where a, (3 G r(V°) are such 
that {X,a), (Y,j3) G T(D n T^), and ir p : 3 _1 (/°) -> M p is the projection. The pair (Alp, D p ) is called the 
(Dirac optimal) point reduced space of (M, D) at p, where D p is the graph of the presymplectic form lo p . 



Note that if D is the graph of a Poisson structure on M, the distribution Go is {0}, all functions in G°° (M) 
are admissible, and we a re in the setting of the Optimal point reduction by Poisson actions Theorem (see 



Ortega and Ratiul (|2004h . Theorem 9.1.1) 



Proof: Denote by $ p : G p x 3^ 1 (p) — ► 3~ x (p) the restriction of the original G-action on M to the Lie 
subgroup G p and the manifold 3~ 1 {p)- Since, by hypothesis, the G-action on M is free and the G p -action 
on 3~ 1 (p) is proper, the quotient 3~ 1 (p)/G p is a regular quotient manifold and hence the projection ir p : 
3~ 1 {p) - ► d~ 1 {p)/G p is a smooth surjective submersion. We show that lo p given by (|4Tj) is well-defined. Let 
to,™' G 3~ 1 (p) be such that 7r p (m) = w p (m'), and let v, W G T m 3~ 1 (p), v',w' G T m '3~ 1 {p) be such that 
T m % p {v) = T m ,K p (v'), T m np(w) = T m i-K p {w'). Let (X,a),(X',a'), {Y,(3), (Y',(3') be sections of D n % ± 
such that X(rn) = T m i p (v), X'(m') = T m n p (v'), Y(m) = T rn i p (w), Y'(m') = T m n p (w'). The condition 
7t p (to) = 7t p (to') implies the existence of an element k G G p such that to' = $^(m). We have then 7r p = 
7r p o and thus T m ir p = T m /7r p o T m <& p k . Furthermore, because of the equalities T m ir p (v) = T m iir p (v'), 
T m Ti p (w) = T m ,TT p (w'), we have 

T m ,TT p (T m <f> p k (v) - v') = and T m ,ir p {T m <S> p {w) -w')=0 

and there exist elements £ 2 G fl p and sections (Pi, 771), (^,772) G T(D n 0C ± ), such that 

X'(m') - T m <Z> k (X(m)) = &(m') = V x {m') and Y'{m!) - T m <f> k (Y{m)) = &(m') = V 2 (m'). 



This yields 

X'{rri) = ($t-iX) (rri) + V x {rri) and F'(m') = (3>l-iY) (to') + V 2 {m'). 
Because (X', a') and ($^_ i y, $t_i/3) are sections of D PI 3C 1 - in a neighborhood of the point to', we have 

($*_ l/ 3)(X') = -a'($*_ 1 y), (42) 
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and thus we conclude 

u p [-K p {m')){T m ,T, p {v'),T m ,-K p {w')) = [Tr;u p )(m')(X'(m'),Y'(m')) 

= a 1 '{m'){Y> \m')) = a'(m') (($* k -iY) (mf) + V 2 (m')) 

= a'(m')(($UY) (to')) 

^-(®UP)(m')(X'(m')) 

= -(^_x/3)(m') ((n-iX) K) + ^i(m')) 

= -(^.x^K) ((*;-xX) (m')) 

= -/?(rn)(X(m)) 

= w p (7r p (m))(T m 7r p (w),r m 7r (9 (w)). 

Finally, we show that ui p is closed. Let m £ 3~ 1 (p), v,w,u G T m J _1 (p). Again, choose sections (X, a), 
(Y,/3), (Z,7) e r(Z) n 9C X ) such that u = X(m), w = Y(m), and u = Z(m). Now write 

(d(7r; Wp )) m («, u) = (d(^> p )) m (X(m), y(m), Z(m)) 

and compute, recalling the definition pTj) and formula (|38[) . 

d(^ p )(x, y,z) = x [{*; Up yy, zj\ - y [{^(x, z)] 

+ Z [(7r* p u p )(X,Y)} -(n* p e> p )([X,Y],Z) 

+ (tt> p ) ([x, z], y) - fr; Up ) ([y z], x) 
ii[OT + y[ 7 (i)] + zKr)] 

+ j([X, Y\) + (£x7 - izda)(Y) + a([y Z]) 
= X \fl(Z)] + Y [ 7 (X)] + Z [a(Y)} + 7 ([X,Y]) - 7 ([X,Y]) 

+ x [7(F)] - z [a(y)] + y [a(Z)] + a ([Z, y]) + a ([y, Z]) 
= x [/3(z)] + y [ 7 (x)] + x [ 7 (y)] + y [a(z)] 

= X [/3(Z)] + y [ 7 (X)] - X [/3(Z)] - y [ 7 (X)] = 0, 

where we used the fact that -f(X) + a(Z) = and 7(y) + j3(Z) = (this follows directly from (X, a), (Y, /?), 
(Z, 7) £ r(.D)). Thus, 7r*dw p = d(7r*a; p ) = and, because ir p is a surjective submersion, it follows that 
dcj p = 0. Therefore, lu p is a well-defined presymplectic form on M p . □ 

Recall that, since DflX 1 is assumed to have constant dimensional fibers, one can build the reduced Dirac 
manifold (M, D rc( j) as in The following theorem gives the relation between the reduced manifold M and 
the reduced manifolds M p given by the optimal reduction theorem. 

Theorem 6.2 If m £ 3^ 1 (p) Q M, the reduced manifold M p is diffeomorphic to the presymplectic leaf N 
through 7r(m) of the reduced Dirac manifold (M,D rc( j) via the map Q : M p — > N, Tt p (x) >—> (71- o i p )(x). 
Furthermore, Q*lu^ = lo p , where LOpj is the presymplectic form on N . 

Proof: First of all, we will show that the distribution Dq is spanned by G-sections that "descend" to M. 
Let X be an arbitrary section of Dq. Then we find a £ r(V°) such that (X, a) £ T(D (~l % ± ). Since G acts 
on M by Dirac actions, we have (£^ M X, £$_ M a) £ T(D) for all £ 6 0- If r] £ 0, we get 

(%,«)(ltf) = £t M {a{VM)) - a{[£M,VM]) = £^ M {0) + ot([C,rj\ M ) = 0, 

because a annihilates the infinitesimal generators of the G-action. Hence, we have (£^ M X, £^ M a) £ T{D n 
% ) and consequently £% M X £ Y{fDa). Now, if V is an arbitrary section of V, it can be written V = 
Yli=i fiCnf wnere fi, ■ ■ ■ , fk are smooth locally defined functions on M and , . . . , £ fe } is a basis for g. 
Therefore, [V,X] = £* =1 /ii^X - £* =1 x iMli e T(V G + V). Thus we have [T{V G ),T(V)} C T{D G + V) 
and, since Dq = ni(D HOC- 1 ) is the image of the vector bundle D n it is locally finite. Proposition lA.il 
then guarantees that Dq is spanned by sections X satisfying [X, T(V)] C T(V). A vector field satisfying this 
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condition will be called descending in the following, because, using [A~3l we can write each such vector field 
locally as a sum X = X G + X v with X G £ X(M) G and X v G T(V). 

Hence we can choose U C Af open with m G U and smooth local descending vector fields X\,... ,X r , 
such that for all q G U we have 23g( ( ?) = s,p&n{Xi(q), . . . ,X r (q)}. Let F be the set of all such sections 
of T>c- Then F is an everywhere defined family of descending vector fields on M that span the integrable 
generalized distribution Dq. Denote by Ap the pseudogroup of local diffeomorphisms associated to the flows 
of the family F, i.e., 

Ap = {1} U {F t \ o • • • o F£ | n G N and F£ or (F t n n flow of X" G F}. 

This pseudogroup is integrable, that is, its orbits define a generalized foliation on M. The leaves of this 
foliation are also called the accessible sets of Ap. Since Dq is also integr able, th e access ibl e sets of Ap are the 
i ntegr al leaves of Dq. The proofs of these statements can be found in [Stefan (|l974alibh : lOrtega and Ratiul 



(2004J) gives a quick summary of this theory. 

Now we turn to the proof of the claims in the theorem. We begin by showing that the map is well- 
defined. Let 1,1/6 3~ l (p) be such that n P (x) = ir p {y). Then there exists g G G p C G such that = y 
which implies that & g (i p (x)) = i p (y) and ir(i p (x)) = n(i p (y)). Thus, it remains to show that ir(i p (x)) G iV. 
Since a; G 3 _1 (p) and the integral leaves of Dq coincide with the A^-orbits, it follows that i p (m) and i p (x) 
can be joined by a broken path consisting of finitely many pieces of integral curves of G-equivariant sections 
of Dq. To simplify notation, we shall write in what follows simply x for i p (x) and m for i p (m). Assume, 
without loss of generality, that one such curve suffices, i.e., that x = F x (m), where F x is the flow of a 
descending vector field X G L(D G ). Since [X G ,X V ] = 0, we have F t x = F x ° o F t xV = F x ~* o F x ° , where 
F t x and F t x are the flows of X G and X v , respectively. Let F be the flow on M induced by F x , i.e., 
7r o F s x = iro F x o F s x = 7r o _F S X = F s o 7r for all s. This flow F generates a vector field X on M such that 
X X. Since X is a descending section of tt\(D n DC ), we know by the definition of the reduced Dirac 
structure that X G L(Gi). Now we have Ft(ir(m)) = 7r(Ft(m)) = 7r(a;) which shows that ir(x) and 7r(m) lie 
in the same presymplectic leaf N of (M, F re d). This concludes the proof that : M p — > iV is well defined. 

To prove that is injective, let 7r p (j;), n p (y) G M p be such that 7r(x) = 7r(y). Then 1,56 3^ 1 (p) and there 
exists j £ G satisfying $ 9 (ar) = y. This shows that g £ G p and = y, so we get ir p (x) = n p (y). For 

the surjectivity of choose ir(x) G iV and assume, again without loss of generality, that ir(x) = F x (ir(m)), 
where X is a section of Gi and F x is its flow. Choose a vector field X G r(7Ti(D PlDC- 1 )) such that X ^ X. 
Then the flow F x of X satisfies 7r o F s x = F s x o ir for all s and restricts to If we define x' = F t (m) 

we get 7r(x') = F x (ir(m)) = ir(x) and hence 0(7r p (a;')) = n(x). Note that we have simultaneously shown 
that tt(3~ 1 (p)) C M is equal as a set to iV. Moreover, we claim that the topology of N (which is in general 
not the relative topology induced from the topology on M) is the quotient topology of 3~ 1 {p), that is, a set 
is open in N if and only if its preimage under w\g-i^ is open in 3~ 1 {p)- 

The topology on N is the relative topology induced on TV by a topology we call the Gi-topology on M: 
this is the strongest topology on M such that all the maps 

U -» M 

(h,...,t k ) - (F t fo..-oF t f )(m) 

are continuous, where m € M, F t Xi is the flow of a section Xj of Gi for i = 1, . . . , k, and {/ C is an 
appropriate open set in R fc . In the same manner, because 3~ 1 (p) is an accessible set of the family 

{X G X(M) I 3a e fl X (M) such that (X,a) is a descending section of D n DC -1 }, 

the topology on 3 _1 (/o) is the relative topology induced on 3 _1 (/o) by the topology we call the m(D n % ± )- 
topology on M: this is the strongest topology on M such that all the maps 

U -> M 
(*!,...,**) -» (F t ? 1 o...oi£* )(m) 



are continuous, where m G M, F^ is the flow of a vector field Xi on M such that there exists a, G fi 1 (M) 
such that is a descending section Z? n OC 1 - for i = and J7 C M fc is an appropriate open 

set in Now our claim is easy to show, using the fact that for each section X of Gi, there exists a 
descending section (X, a) of D n DC^ such that X ^ X and hence F x o n — tt o F x . Conversely, for each 
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descending section (X, a) of D fl DC , the vector field X satisfying X ~„. X is a section of Gi and we have 
F t x o 7r = 7r o F t . Hence, a map / : N — * P is smooth if and only if / o 7r|<j-i( p ) : 3~ 1 {p) — > -P is smooth. 

Thus, the smoothness of and of its inverse © _1 : N — > M p , 7r(x) i— » 7r p (a;) follow from the following 
commutative diagrams: 



3~Hp) 



M 



M 



iV ■ 



M„ 



Consider the first diagram: since 7r o i p is smooth, we have automatically (by the quotient manifold structure 
on M p ) that o 8 is smooth. Since N is an initial submanifold of M, the smoothness of O follows. With 
the considerations above and, because ir p is smooth, we get the smoothness of _1 with the second diagram. 

Now we show that is a presymplectomorphism, i.e., Q*lu^ = uj p . Let ir p (x) G M p , x G 3~ 1 (p), and 
v, w G T x 3- 1 (p),_le., we have T x i p v, T x i p w G V G (i p (x)). Then find 1,7 6 T (Gi) and aje fi^M) such 
that (X,a), (Y,/3) G r(£) red ), 7^(0 o tt p )v = T x (tt oi_ p )v = X(w(i p (x))), and T X (Q ott p )w = T x (ir o i p )w = 
Y(n(i p (x))). Choose X,Y £ X(M) such that I ~, I, 7 ~ w F, and (X, -rr*a), (Y, n*(3) G F(L> C\X ± ). Then 
we get 

(0*Wjv)(7r p (a;)) (T x ir p v, T x n p w) = uj^ ((0 o % p )(x)) (T X (Q o 7r p )w, T X (Q o 7r p )w) 

= WAr((7r o i p )(x)) (T x (tt o i p )v, T x (ir o i p )w) 
= cj R ((n o i p )(x)) (X(n(i p (x))), Y(n(i p (x)))) 
= a((n o i p )(x)) {Y(n(i p (x))) = a(i p (x)) (Y(i p (x))) 
= ujp(iip(x)) (T x ir p v, T x ir p w) , 

where the last equality is the definition of uj p . □ 



6.2 Induced Dirac structure on a leaf of T> G 

In order to check the power of the Optimal Point Reduction Theorem 16.11 we shall implement it in the 
case of the trivial symmetry group G = {e}. For this and also for the reduction of dynamics in the next 
subsection, we need to describe the induced Dirac structure on a leaf 3~ 1 (p) of Dq- Of course, we could use 
the fact that since 2) G is a subdistribution of Gi, each leaf of Dq is an immersed submanifold of a leaf of Gi. 
Knowing this, the induced Dirac structure on a leaf of Dq is the graph of the pullback of the presymplectic 
2-form on the corresponding leaf of Gi. But we want to get the stratification in presymplectic leaves of M 
as a corollary of Theorem 16. li and so we have to derive directly the induced Dirac structure on 3~ 1 (p) from 
the definition of the map 3, which is what we do next. 

Let i p : 3~ 1 {p) °->Mbe the inclusion. Define the smooth 2-form on 3~ 1 (p) by 

w a - l(p) (m)(X(m),F(m)) = a m (Y(m)) = -(3 m (X(m)) (43) 

for all X, Y G (/£>)) and m G 3~ 1 {p), where X, Y G r(D G ) are such that X ~ ip X and Y ~; p Y and 

a, (3 are sections of V° such that (X, a) and (Y, /3) G T(_D n 3C ). Note that in the proof of the closedness 
of w p , we have shown that u>g-i( p ) — tt*uj p is a smooth closed 2-form on 3 (p)- 
The induced Dirac structure on 3~ 1 (p) is given by 

t(zvmp)) = {(x,a)er (Tr x (/0 © ^rV)) | i^a-w = . 

Let (X,a) G r(L> 3 -i (p) ). Let X G X(M) be such that X G T(2) G ) and X ~ ip X. Hence, there exists 
a G F(V°) such that (X,a) G r(D) . Then, choosing for each Y G 3L(d~ l (j>)) a section Y G r(D G ) with 
f ~ ip Y and a section (3 G r(V°) such that (F,/5) G r(L>), this yields 

a(m)(Y(m)) = {i^ s -i {p) ){m){Y (m)) = a(m)(Y(m)) = (i*a)(m)(Y(m)) 
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and we get a = i*a. 

Choose now an arbitrary a' £ f2 1 (M) (not necessarily in r(V°) ) such that (X, a) G r(£)). We get 
a — a' G r(Po) and hence a(Y) = a' (Y) for all Y G T(Gi). In view of the considerations above, this yields 
a = i*oi. Now recall that each X G X(M) satisfying X ~j p X is necessarily a section of Dq. We have 
proved the following result. 

Proposition 6.3 The induced Dirac structure Dg-ir p \ is given equivalently by 

Dg- Hp) (m) ={(X(m),a(m)) G T m r\ P ) (BT^ip^X e X(3~Hp)), 
a G Q 1 (3~ 1 {p)), there exists X G X(M) and a G Vt x (M) 
such that a = i* p a, X ~ ip X, and (X, a) G T(D)\ . (44) 

This formula was found by iBlankenstein and van der Schaftl (1200 lh in the case of submanifolds. The 



proposition above extends it to the important case of the level sets 3 1 (p) of the optimal momentum map 
3 which are only initial submanifolds. 

Now we go back and apply Theorem 16. II to the case G — {e}. This condition implies that Dq = Gi and 
so the leaves of the generalized foliation are the presymplectic leaves 

N := M p = 3-\ P )IG p = 3-\p)/{e) = r X (p). 

Thus, if G = {e}, the presymplectic form con given in (|43[) is equal to to p in the Optimal Point Reduction 
Theorem 16.11 Hence the Dirac structure on the presymplectic leaf N is given by 

D N = {(X, a) G T(TN © T*N \ i^N - «} 

= a) G T(TN © T*N) \ there exists X G X(M) and a G Q X (M) 

such that a = i* N a, X X and (X, a) G r(D)| 

where ijv : N — ► M is the inclusion. This is exactly the induced Dirac structure given by © . 

Thus, the theorem stating that each closed Dirac manifold has a generalized foliation by presymplectic 
leaves, each leaf having the induced Dirac structure, is the trivial case of the Optimal Point Reduction 
Theorem 16.11 

Note that Theorems 16.11 and 16.21 extend this result by characterizing the presymplectic leaves of M if 
G^{e}. 

Remark 6.4 Assume that M is a Poisson manifold and D is the graph of the induced map j) : T*M — > TM . 
We sh all prove t hat the reduced spaces M a are symplectic manifolds, in agreement with the result of lOrtega 



( 20021 ) (see also Ortega and Ratiul (|2004h . Theorem 9.1.1). Indeed, the reduced distributions G(J are all 



trivial. To see this, recall that Gq/V p descends to G(J, where G(J is the distribution defined by the Dirac 
structure on 3 _1 (p). Let I be a section of Gq, i.e., there exists X G T(T>g) and a G r(V°) such that 
X X, i*a — and (X, a) G T(D). Choose an arbitrary section j3 of V° . Since D is the graph of the map 
jj associated to the Poisson structure, Pi = TM and there exists Y G T(T>a) such that (Y, (3) G T(D). Let Y 
be the vector field on 3~ 1 {p) such that Y Y . Then (Y, i*/3) is a section of D$-xt p \ and we can compute 
for all m G 3 1 {p)'- 

f3 lp{m) (X(i p (m))) = (i* p f3) m (X(m)) = (i* p a) m (Y (m)) = 0. 
Hence, we have X(i p (m)) G (Dq D V)(i p (m)) and hence X(m) G V p (m). A 

6.3 Reduction of dynamics 

In this subsection we study the dynamic counterpart of the geometric Theorem 16. II 

Definition 6.5 Let Dh C TM denote the affine distribution whose smooth sections are the solutions of the 
implicit Hamiltonian system (X, dh) G r(D) for an admissible function h G C°°{M), i.e., the vector fields 
X G X(M) satisfying (X,dh) G T(D). 
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If Xh is a solution of the implicit Hamiltonian system (X, dh) E T(D), then T(Dh) = Xh + r(Go)- Indeed, 
if X is another solution, we have (X, dh), (Xh, dh) G T(D) and thus (X — Xh, 0) G T(D) which is equivalent 
to X — Xh E r(Go). This shows that T>h C TM is really a smooth affine distribution. 

If h G C°°(M) G and X h G T{V h ), we have A h G T(D G ) since d/i G T(V°). This also shows that T) h C D G . 
According to the considerations in £ )6.2[ if X/, G 3E(jJ -1 (p)) satisfies .X/, ~i p Xh then we necessarily have 
(X h ,d(i* p h)) E T(Dg-i^). Hence X h + r(Gg) = r(D ftoip ), where Gg is, as above, the distribution defined 
by the Dirac structure on 3~ 1 (p)- Denote by Gg the smooth distribution associated to the Dirac structure 
on M p ; hence Gg is the kernel of the presymplectic form u> p . 

Theorem 6.6 Assume that Gg is a locally finite smooth distribution on 3~ 1 {p)- 

(i) Let h E C°°(M) G be an admissible function. The flow F t of each Xh E T(T>h) leaves 3~ 1 (p) invariant. 
The set T>h is G -invariant in the sense that = Dh for all g E G. Therefore, the set D/ l0 i p C 
T3~ 1 (p) is G p-invariant. 

(ii) The affine distribution T>hoi p Q T3~ 1 (p) projects to an affine distribution T>hoi„ on M p . 

(iii) Since h E C°° (M) G , the function h p given by the equality h p oir p = hoi p is well defined. It is admissible 
and the corresponding affine distribution D p h is equal to T>hoi p ■ 

(iv) Let k E C°°(M) be another admissible G -invariant function on M and {■, -} p the Poisson bracket on 
admissible functions associated to the Dirac structure D p on M p . Then ({h,k}) p — {h p ,k p } p , where 
{ , } p is the Poisson bracket on admissible functions on M p . 

The proof of (ii) requires the technical result given in Proposition IA.1I of the appendix. 

PROOF: (i) The first statement is obvious: since (X h ,dh) G T(D) and dh E T(V°) we have (X h ,dh) G 
T(D n X^) and hence Xh E F(Dg) which implies that the flow of Xh leaves the leaves of the generalized 
foliation defined by the distribution Bg invariant. However, these leaves are precisely the level sets 3~ 1 (p)- 

To prove the second statement, we first show that + Y) E Xh +T(Go) for all g E G and Y E T(Go). 

Indeed, we have ($*(X h +Y), $*d/i) G T(D). Since h is G-invariant, this yields (<f>* g (X h +Y), dh) E T(D) and 
consequently ($£ (X h + Y)-(X h + Y), 0) G T(D). Thus we have $* (X h + Y) EX h + Y + T{G ) = X h +T(G ). 
This shows that $ g (CD/,) C T>h for all g E G since r(D^) = Xh +r(Go). The reverse inclusion is obtained in 
the following way: X h + T(G ) = ($* o + r(G )) C $*(X ft + r(G )). 

To prove the third statement we note that for all g E G p we have i p o <fr p g = $ 9 o This easily implies 
that ®Z*Xh ~ ip &*X h and $g*d(/i o i p ) = j*($*d/i) = i*dh. The last statement follows now by repeating 
the method of the proof above. 

(ii) For all £ G Q we have (£% M Xh, £^ M dh) = (£^ M Xh, 0) G T(D). Because £g-i( p ) is i p -related to £m for 
all £ G q p , we conclude that £^ i _ 1 Xh is i p -related to £^ M Xh and with the formula (14"4")) for Dg-i^, this 
yields 

(W^tnw^) = K-x w ^. A,-i w (»W) = (£ £s _ Hp) X h ,i;(£, M (dh))) 

= (£ Ca _ lw x h ,o)er(i? a -x (p) ). 

Thus, £^_ 1 X h E r(Gg) and the inclusion [X h ,T(V p )] C T(V p + Gg) follows. Furthermore, since Gg = D 

with G C°°{d~ x {p)) G , an analogous argument shows that [r(Gg), T(V P )] C T(V p + Gg). This shows that 
all hypotheses of Proposition lA.il are satisfied for the involutive subbundle V p of T3~ 1 (p) and the locally 
finite generalized distribution Gg. Thus there exist Z E r(Gg) and Xh E 3C(M p ) such that Xh + Z Xh- 
In addition, Proposition IA.1I ensures the existence of spanning vector fields Z\,. . . ,Z^ for T(Gg) such that 
Zi ~7r p Zi for vector fields Z\, . . . , Z^ E X(M p ), i = 1, . . . , k. Hence T)hoi p projects to the affine distribution 
T> h oi p on M p defined by T(f> h oi p ) = X h + span c=0(Mp) {Zi, . . . , Z k }. 

(iii) We continue to use the vector fields defined in the proofs of the first two statements. First show that 
hp is admissible, i.e., (Xh,dh p ) E T(D p ). To see this, note that for m G 3 _1 (/o) and v = Y(m) E T m 3~ 1 (p), 
we have 

(i^) (7r p (m)) (T m n p v) = {■K* p u p ){m){{X h + Z)(m),Y(m)) = (^d/i)(Y)(m) 

= (7r* p dh p )(Y)(m) = (dh p ) (7r„(m)) {T m v p v) 
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Thus, we have uj p — dh p and hence (Xh,dh p ) G T{D p ) which shows that h p is admissible. By an 
analogous argument with h p replaced by the zero function on M p (and h by the zero function on M) we get 
that Zi G T(Gq) for i = 1, . . . , k, and hence that T>hoi p Q I>£ • Denote Xh — Xh p , since Xh is a solution of 
the implicit Hamiltonian system (Xh,dh p ) G T(D p ). 

For the converse inclusion, it is sufficient to show that Xh + G{J C T>hoi p - So we need to prove that if 
Y G r(Gg), then X hp +Y <E T(T> holp ). Indeed, since Y G T(Gg) C X(M), there exists Y G such 
that Y ~ np Y. The existence of Y G r(D G ), a G r(V°) with Y ~ ip Y~ and (V,a) G T(D) follows by the 
construction of the leaf For all m G 3 _1 (/o) and u G Dg(wi) C T m M we get 

a(m)(u) = w a -i( p )(rn)(i > (m),w) = (7r*w p )(Y(m), w) = w p (7T p (m))(Y(7r p (m)), T m ir p (v)) =0 

where the last equality holds because Y G T(Gq). Now (|44p leads to (y,i*a) G r(Dg-i( p )) and from the 
computation above we conclude hence that i*a = 0. Therefore Y G T(Gq). Since Y + Xh + Z ^^ p Y + X] lp 
the assertion is shown. 

(iv) This last statement is a straightforward computation which follows from the considerations above. 
Indeed, for all m G d~ 1 (p) we have 



({h, k}) p (7r p (m)) = ({h, k} o i p )( m ) = (dh) (i p (m)) (X fc (i p (m))) = (*;dA)(m) (x fc (m; 

= {n;dh p ){m) (l fe (m)) = (d/i p ) (7r p (m)) (t„^ p (l fe (m))) 
= (-dfcp) (7r p (m)) (X hp (7r p (m))) = {/i p ,/c p } p (tt p (to)), 

where the sixth equality follows from T m 7r p {X^{m)\ G Dk p (m) which holds since Xf-( m ) £ ^koi p (jn)- O 



7 Optimal reduction for nonholonomic systems 

Recall the setting of Q is a configuration space which is a smooth Riemannian manifold, D C TQ is 
the constraints distribution given as the intersection of the kernels of k linearly independent 1-forms on 
Q and is hence a vector subbundle of TQ, L is a classical Lagrangian equal to the kinetic energy of the 
given Riemannian metric on Q minus a potential, M := ¥L(T>) C T*Q is a submanifold and represents the 
constraints in phase space T*Q, and u>m '■— **w can G il 2 (M) is the induced 2-form on M, where i : M T*Q 
is the inclusion and w can the canonical symplectic form on T*Q. The distribution J£ := TMC\ (T7rT*Q) _1 (D) 
is not integrable but has the property that the restriction lo<k of ojm on J£ x is nondegenerate. The Dirac 
structure D associated to this nonholonomic system has fibers 

D{m) = {(X(m), a m ) G T m M © T^M | X G r(Jf), a - i x w M G T(J{ )} 

for all m G M and is, in general, not integrable. Recall from Proposition ^. 2f i) that Go = {0} and Pi = T*M 
and hence all functions are admissible. 

Consider a G-action cj) : G x Q — > Q on Q that leaves the constraints and the Lagrangian invariant. The 
lift $:Gx T*Q — > T*Q of the action is defined by <& s = {T<j> g -i)*\ this is a symplectic action on T*Q that 
leaves M invariant. Thus we get a canonical G-action on the Dirac manifold (M, D) and we have for all 
9 G G, 

since the G-action commutes with the inclusion. Note that in this section the G-action on T*Q is a lift, 
whereas in []4]we needed only that it is a symplectic action. 

In this section we shall define a distribution on M that yiel ds the equations of mot ion and the conserved 
quantities gi ven by the Nonholonomic Noether Theorem (see ICushman et al. f|l995h . Theorem 2 and also 



Blochl (|2003l) . Chapter 5 and the corresponding internet supplement). If this distribution is integrable, we 



will prove a Marsden-Weinstein reduction theorem that gives a reduced Dirac structure which is the graph of 
a nondegenerate 2-form (not necessarily closed) . This reduction procedure is done from an "optimal" point 
of view as in $6l although these to sections are completely independent of each other. 
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7.1 The nonholonomic Noether theorem 



We recall in this subsection the Hamiltonian formulation of the Nonholonomic Noether Theorem. Let J 



T*Q — > g* be the canonical momentum map associated to the action of G on T*Q (see, e.g-. lMarsden and Ratiu 



(1999)) 



J(p)(0 = <P.fo(*(p))> (45) 

for all p E T*Q, where ir : T*Q — > Q is the projection. For all £ E g, the ^-component of J is the map 
J« : T*Q -> R is defined by 

J f (p):=(J(p),e> (46) 

for all p E T*Q, where (■ , ■) is the natural pairing between elements of g* and elements of g. We shall denote 
by the same symbol its restriction to the manifold M. For an arbitrary £ E g we have therefore 

k-r'Q^n = dJ«. (47) 

Since the action of G on T*Q leaves the submanifold M invariant, we have £t*q(w) E T m M for all m E M 
and hence the fundamental vector field £t*q is «-related to £m> i-e., o £ M = £y»Q o i, where i : M "^-> T*Q 
is the inclusion. Choosing for each vector field X E X(M) an arbitrary extension X' E X(T*Q) (and hence 
X ~i X') we get for all m £ M, 

i^ WM (X)(m) = if^i'MWW = i CT , Q o; cail (X0(i(m)) - (dJ«(*'))(*(m)) 
= (i*dJ«)(X)(m) = (dJ«(AT))(m) 



which shows that (|47|) naturally restricts to M 

i eM w M = dJ^. (48) 
Define for all p E M the vector subspace g p := {£ E g | £m(p) £ (V n IK)(p)} Cg. Then 

M := U S P 

peM 

is a smooth (not necessarily trivial) vector subbundle of the trivial bundle M x g if and only if IK + V 
has constant rank on M, for instance if IK + V = TM. Indeed, note first that g M = A _1 (V n IK), where 
A : M x g — > V is the vector bundle isomorphism over M given by A(m,£) := £,M(m). However, since !K + V, 
IK, and V are subbundles of TM, it follows that VnlK is also a subbundle of both TM and V. Consequently, 
g^ = A _1 (V n IK) is a subbundle of the trivial vector bundle M x g. Thus, if g M is a vector bundle over M, 
then V(~l IK is also a vector bundle and hence its fibers have constant dimension on M. It follows immediately 
that the rank of IK + V is also constant on M. 

For the rest of this subsection we assume that !K + V has constant rank on M and hence that g^ is a vector 
subbundle of the trivial vector bundle M x g. If ^ is a smooth section of g M , then := (£ M (p)) m (p) 
defines a smooth section of V D IK. Conversely, if {£} , . . . ,£ fc } is a chosen basis for the Lie algebra g, then 
the vector fields . . . are global vector fields on M that don't vanish and are everywhere linearly 
independent. Hence, £\ 4 , . . . , £^ are smooth basis vector fields for the bundle V. Every section £ of V n IK 
can hence be written £ = y'., /j^l^ with smooth (local) functions /i, . . . , and corresponds exactly to 
the section = £* =1 of g w . 

Note that since V n IK is a subbundle of TM it is a locally finite smooth distribution. Hence, since we 
have 

[r(v n ik), r(V)] c r(V) = r((V n IK) + V), 

we get with Propositions ! A. ll and I A. 2l for each p E M there exists a neighborhood U of p and G-equivariant 
spanning sections of V n IK on U. 

Let £ M be a smooth section of g . For all p E M and all X E X(M) the definition of the corresponding 
£ and (08]) yield 

^m(p) «(p), *(p)) - dJ«^>(p) (X) . (49) 
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As above, write ^ — ^2 i=1 fiC with smooth functions fi, ■ ■ ■ , fk and the chosen basis {£ 1 , . . . , of g. 
Define the smooth map 



Using (g5|) and (JUJ) we get 



J« M : M -» K 

p ^ j^w(p) = (j(p),e :K (F))- 



If c : (— e; e) — » M is a solution curve of a vector field X G X(M) with c(0) = p, we have 



J £ "(<«))= A 
t=o ar 



ft 



5>w*))j*>(*)) 



4=0 f= i 
fc 



= £ d/i(p)(c(0))J** (p) + ]T /i(p)dJ c " (p)(6(0)) 
t=l t=l 

= jEf^d/iWW^) +dJ« sc W(p)(X) 
= J*[«%) + dJ^>(p)(X), 

where we write X [^j := £* =1 Tllus © becomes for all p G M and all X G X(M), 

^m(p) (*(p),*(p)) = dJ« M (p)(X) - J x t^](p). 

Hence, if the one form cfi G il 1 (M) is defined by 

afi(X) :=dJ^(X)-J x [t K ] 

for all X G X(M), we have i^ljm = & and so the pair (£, a*) is a section of D. 

Let h be a G- invariant Hamiltonian and Xh G L(!K) the solution of the implicit Hamiltonian system 
(X,dh) G r(£>). Then 

dJ«" - J^[«"] = ofi(X h ) = u M (t, X h ) = -dh{£) = 

since d/i(£)(p) = (d/i(p),£(p)) = (d/i(p), (£ W (p)) M (p)) = by G-invariance of ft,. Thus, we have proved the 
following result. 

Theorem 7.1 Let £ 6e a section of and Xh G L(3f) i/ie solution of the implicit Hamiltonian system 
(X,dh) G r(D), where h is a G-invariant Hamiltonian. Then Xh satisfies the Nonholonomic Noether 
Momentum Equation; 

djt X (X h ) - J Xfe [« M ] = 0. (50) 

Recall from ([^j) and that Jt is defined in terms of the given Lagrangian L : TQ — > K and hence, only 
the dynamics defined by the corresponding Hamiltonian H is the object of interest. Thus, for each other 
Lagrangian L' we obtain another distribution IH'. 

Remark 7.2 In iBlochl (|2003h . Theorem 5.5.4, the Nonholonomic Noether Theorem is formulated in terms 
of a Lagrangian of a classical mechanical systems (hence equal to the kinetic energy of a metric minus a 
potential). Let Vq C TQ be the vertical subbundle of the action : G x Q — > Q. Under the Dimension 
Assumption D + Vq = TQ, the distribution D[~iVq is a smooth subbundle of TQ. Note that this assumption 
leads automatically to J + V T * Q = TT*Q and hence to IK + V = TM. 
The smooth vector bundle 0° := [j peM 0° (q) is defined pointwise by 

Q^iq) :={£e fl |6 3 (g)e(V Q nD)(g)}c fl . 
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Let (g 25 )* be the dual bundle, that is, its fibers are (g )*(<?) := (g (?))* for all q E Q. The nonholonomic 
momentum map J nhc : TQ -> (g 15 )* is the vector bundle map over Q defined by 

<9L 



(J nllc K) J = {FiM,e Q (3)) 



;(£q) 1 (<?) =: J nhc (OK) 



where £ S 1> (<z)- Let £° be a section of the bundle g D . Theorem 5.5.4 in iBloch ( 2003f) states that any 
solution c(t) = (q(t),q(t)) of the Lagrange-d'Alembert equations for a nonholonomic system must satisfy, in 
addition to the given kinematic constraints, the momentum equation 



dt 



J nhc (em)) ft*)) 



dL 



dt 



(em) 



(51) 



Since for all £ G g the vector field £t*q is the cotangent lift of £q, we have in local charts 



d dC c 



d 



Hence, if £g(g) G £>(g), we get ir*Q( a q) S ^(ojg) for all a 9 G T*Q (see (|2"3")l for the definition of J). 

Consequently := (C 1) ( 7r T*Q(p))) M (p) for all p G M defines a smooth section £ of V n "K and hence 
a smooth section £ M of g M . Note that £ M (p) = ^(7t t .q(p)) for all p G M and if g B = £)* =1 /*£q witn 
smooth functions fi, . . . , then g^ = Fi^ l M with the smooth functions fj defined by F{ = i^jir^gfi, 

where %m ■ M T*Q is the inclusion. Let Xh be a solution of the implicit Hamiltonian system (X, dH) G 
L(Z?), where is the G- invariant Hamiltonian on M associated to the Lagrangian L by the Legendre 
transformation, and p(t) an integral curve of Xh- Then c(t) := (q(t),q(t)) = (FL) _1 (p(t)) is a solution of 
the Lagrange-d'Alembert equations. We have for all t 

o=(djf*(x H )-jM^]) ^t)) = p^(p(t)) 



J2 Tt (FMt)))^( P (t)) 

i—1 



dt 



P {t), (e(p(t))) Q («(«))) - ( p(ti ( -iFMt)))e ) (q(t)) 



= - (¥L(c(t)), (e(q(t))) Q (q(t))) (FL(c(t)), ( - (/,(?(<)))£* ) (?(*)) 



Fi(c(t)), (^(«(t))) Q («(«))) - ( Fi(c(t)), ( Jt e (<?(*)) 




p^(e(m)(c(t))-H 



±(em) 



Hence our Nonholonomic Noether Theorem 1 7. II is the Hamiltonian version of Theorem 5.5.4 in lBloch (|2003h . 
that is, ([50|) and (fSTj) are equivalent. A 

Proposition 7.3 Assume that V + J£ = TM. Lei £ &e a G-equivariant section of g™. Then the corre- 
sponding section (£,a^) of-D is aZso G-equivariant. There are two possibilities: 

(i) — ijwjf =0 on V fl M. TTien f/iere ea;is£ a' G r(V°) such that (£, a') is a G-equivariant section of 
D n and exactly one section a G L(Pq cc1 ) such that ir*a = a' . Conversely, each section o/L(PQ° d ) 
pulls back to a section a' defined as above and satisfying this condition. 

(ii) VnJ{^ £ U3M and hence on VflJf. Then leads to a momentum equation that doesn't appear 
in the reduced implicit Hamiltonian system. 

Proof: If £ M is G-equivariant, we have £ (</ -p) = Ad g £ (p) for all p G M and hence for the corresponding 
£ we get using , 

(*J(€))(p) = r fl .p$ s -i€( ff -p) = r„v (^(.9 -p)) m to -p) 



(Ad 9 (^(p))) M ( 5 -p) 



(Ad fl -x o Ad 9 (^(p))) M (p) - (£ M (P)) M (p) = £(p) 
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Note that conversely, if £ is equivariant, then the corresponding section £ of q is G-equivariant. Since 
<&g(JM = ojm for all g G G, the section (£, a*) is G-equivariant. Since V + JC = TM, if a* = on V fl J{ 
there exists as in Proposition I4.2f ii) a unique section (3 G r(JC°) such that afi + p G r(V°) and hence 

, + er(Dn3C x ). 

But since = £> and ^DC 1 ^ = DC^ we have also 

(£, + = + *;/?) g r(D n dc x ) 

for all 5 G G and, because (3 is unique, we get $*/3 = /3. Hence the first statement of (i) holds with 
a' := at + (3. But because £ G T(V), the section of -D rc d corresponding to (£,«') will be (0, a) with 
a G Q^M) such that ir*a — a'. 

On the other hand, if we choose a non-zero section a of Pg ed , the codistribution associated to the reduced 
Dirac structure on M, we have (0, a) G r(D ro( j) and we find X G r(J£) such that X ~„- and (X, ir*a) G 
T(D n DC ). If X = 0, then we have n*a — on H + V = TM, contradicting the fact that a is a non-zero 
section of PQ Cd . Therefore X is a non-zero vector field lying in T(JC (~1 V) with ix^M = on JC fl V. We 
conclude from this that the sections of PQ ed pull back exactly to the G-equivariant sections a^ + (3 G r(V)° 
induced by sections £ of (V n "K) n (V n "KY M . 

If VnDi ^ C M and hence a € ^ on VnJ£, then there is no [3 G r(DT) such that (£, a € + /3) G r(L> nJC 1 ) 
and (ii) follows immediately. □ 

Definition 7.4 We will call Nonholonomic Noether Equation a section at corresponding to a smooth section 
^ o/VnlK. A DC-modified Nonholonomic Noether Equation is a 1-form a' G J1 1 (M) that can be written 
a' = at + p with a nonholonomic Noether equation a^ and (3 G r(J£ ). A Descending (Df-Modified) 
Nonholonomic Noether Equation is a (!K-modified) nonholonomic Noether equation as in Proposition \ 7.S\ 

(i)- 

Note that because of the /3-part of a descending DC-modified nonholonomic Noether equation, sections of 
Pp ed don't pull back exactly to sections a^ associated to sections ^ as in Theorem 17.11 (the nonholonomic 
Noether equations). It is possible that they pull back to one-forms that coincide only on D{ with some a^. 

Proposition 7.5 The codistribution spanned by the Noether equations which descends to the quotient M/G 
is given by 

7r 2 (L>n (v© v )) = (b(vn?f) + J{ )nv (52) 

where b : TM — ► T*M is associated to u>m- 

Proof: We have seen that a descending (IK-modified) nonholonomic Noether equation a' is a G-invariant 
section of V° such that there exists a G-equivariant section X of V n "K with (X, a) G T(D n DC ). So we 
only have to show equality (f52"| . Let a be a section of the left-hand side. Then there exists X G T(V n DC) 
such that (X,a) G T(D n (V © V )) and hence there exists G r(Df°) such that a = i x u; M + (3. Therefore 
a G T ((b(V nM) + Df°) n V°). Conversely, let a be a section of (b(V n 5C) + DC 3 ) n V°; then a = i x co M + P 6 
r(V°) with X G T(V n Di) and /? G T(J{ ). But this means that (X, a) is a section of D n (V © V°). □ 

Example 7.6 We compute £ and a^ for the constrained particle (see i j4.3[) . In this example, Q = R 3 , 
M := {(x,y,z,p x ,p y ,p z ) \ Pz = y Px } C T*Q = M 3 x R 3 , and V n 5£ = span{d x + If := (1,0) 

and £ 2 := (0, 1) is the chosen basis of g = R 2 , then (1,0) M = d x and (0, 1) M = 9* so that b (w>p*<Pv) : = 
span{(l, 0) + y(0, 1)} is the fiber of the vector bundle q k at the point (a;, y, z,p x ,p y ) G M. Therefore, any 
section ^ of has the form (x,y, z,p x ,p y ) = f(x, y, z,p x ,p y ) ((1, 0) + y(0, 1)), where / G G°°(M). 
Consequently 

£(x,y,z,p x ,p y ) = (£, :H (x,y,z,p x ,py)) M (x,y,z,p x ,py) = f(x,y,z,p x ,p y )(d x + yd z ). 

The components of the momentum map J : T*Q — > Q* are J^ 1 ' ) = and J^ 0,1 ^ = p z so that the 
restrictions to M of these functions are J^ 1 ' ' = p x and J^ ' 1 ) = yp x . Therefore 3^ (x,y, z,p x ,p y ) = 
f(x,y, z,p x , Py )p x (l + y 2 ) and if X G X(M), then X[f(l, 0) + yf(0, 1)] = X[/](1,0) + X[y/](0, 1) and so 

jXmm+yM,!)] = x[f]px + X [yf]yp x =p x (l + y 2 )X[f] + y Px fX[y}. 
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The 1-form on M which applied to X yields the right hand side is p x (l + y 2 )df + p x fdy and hence 

a*(x,y,z,p x ,p y ) = dJ« (x,y,z,p x ,p y ) - p x (l + y 2 )df{x, y, z,p x ,p y ) - p x f(x,y,z,p x ,p y )dy 
= f(x,y,z,p x ,p y ) ((l + y 2 )dp x +yp x dy) . 

So the section spanning the codistribution Pg ed in this example is (1 4- y 2 )dp x + yp x dy, as (f3"2"j). It is easy 
to see that in this case V C (V n 34)" M and hence the nonholonomic Noether equation descends to the 
quotient. 



7.2 The reaction-annihilator distribution 

An important problem is to decide when the Nonholonomic Noether momentum equation gives a constant 
of motion rather than an equation of motion. We have to distinguish between two cases: 

(i) The section ^ is constant, i.e, = £ for all p 6 M, where (eg. Then £(p) = £m(p) and so 

we have a£ — dJ^, so is a constant of the motion. We will see below that sometimes one can find 
r\ G g such that 3 V is a cons tant of moti on for all solutions of G-invariant Hamiltonians, but rjM is not 
a section of V n 34 (see also iFasso et all ( 20071 )). 



(ii) The other case is that of gauge symmetries, that is, non-constant sections of g that yield constants 
of motion (see Fasso et al. ( 2007t )). Note that if ^ — J2i=i fi£i then it leads to a constant of motion 



if one of the corresponding forms + (3 is exact, that is, we can find / G C°°{M) such that df 
f3 + Yli=i /idJ(£i). However, we do not know of any other characterization of the section so that the 
momentum equation gives constants of motion rather than an equation of motion. 

In the reduction method for nonholonomic systems, the first step is to compute the horizontal annihilator 
U of V, that is, the distribution IX = (Vn34) Wju n34 C TM C TT*Q (see flU). We have seen in Proposition 
I4.2f ii) that any section of U corresponds to one section of Df^X^: for each X £ r(U) there exists a G T(V°) 
such that (X, a) G T(_D) and hence a — ix^M G r(34°). So the method of finding a section a G r(V°) 
associated to X G r(U) is the same as determining (3 G r(34°) such that ix^M + /3 =: a G r(V°). As we 
have seen in §4.4( case 3, sometimes not the whole of 34° is needed in this construction. This is why we 
introduce the new codistribution 31 on M whose fiber at p G M equals 

R(p) = {(3(p) | f3 g r(34°) and there is some X G T(U) such that f3 + i x uj m G T(V )} C 34°. (53) 

In general, 3? is strictly included in 34°. 

If h G C°°(M) G is an admissible function, then there exists X h G r(34) such that {X h , dh) G T{D). Recall 
that Xh is unique since Go = {0}. In addition, since dh G T(V°), we have (X h , dh) G T(D n K^) and hence 
Xh G r(lt). Thus, there exists j3 G r(34°) such that dh = ix h i^M 4-/3. This is exactly the Hamilton equation 
for the given nonholonomic system (with the Hamiltonian h) and we have /3 G r(3l), often in terpreted as 



the re action force. In fact 3?° C TM is the analogue of the reaction-annihilator distribution of IFasso et al 



( 20071) 



Proposition 7.7 We have 

b(U) © X = V° + X, 



where b : TM — > T* M corresponds to lum- 



Proof: The sum on the left hand side is direct since if X G r(ll), then X G r(34) and hence ix^M ^ r(34°) 
because wmImxm is nondegenerate. Thus ix^M £ r(3?) C r(34°). Second, recall that Pi = T*M (see 
Proposition so for all a G r(V°) we find X G r(J£) (actually X G T(U)) such that (X, a) G T(D). 

Thus, ^{DDX 1 -) = V°. 

Now we are ready to prove the formula in the statement. If X G r(U), the considerations in H4.2I show 
that there exists (3 G r(JC°) such that i x u M + ft G T(V°). The definition d53j) of X yields directly that 
(3 G T(X). This shows b(U) © 3? C V° + 31. For the other inclusion, choose a G T(V°) and X G r(U) such 
that (X, a) G r(D n 3C 1 ). Then the definition of D yields /3 := a - i x u M G r(34°) and again, using (|53| . 
we conclude that /3 G T(3l). □ 
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The last Lemma leads directly to the equality 

Note that U WM = ((5£ n V) WM n "KY M =(MnV)+ J£<" M since the kernel of oj m lies in W M . 

Now we are abl e to state th e main theorem of this subsection which is the Hamiltonian analogue of the 
main statement of iFasso et al.l (|2007l ). 

Theorem 7.8 Let £ G g. Then the function 3^ is a constant of motion for every G-invariant Hamiltonian 
h if and only if £ M G T(V n 31°). 

Proof: Choose £ G such that £m G r(Vn3i ). We have seen in the preceding section that 1£ m ujm = dJ«. 
For an arbitrary X G T(U) choose /3 G r(ft) with i x Um + =: a G r(V°) and get 

dJ £ (X) = wmKm.I) = /3(6f) ~ a(fa) = 0. 

This yields the statement since for all G-invariant Hamiltonian h the (unique) solution Xh of the implicit 
Hamiltonian system (X, dh) G r(D) is a section of U (with a = d/i the corresponding section of V° and 
/3 = d/i — ix h ^Ai)- For the converse implication, choose £ G such that J £ is a constant of the motion 
for the solution curves of every G-invariant Hamiltonian. Note that since V is an involutive subbundle of 
TM, the exterior derivatives of all G-invariant functions span pointwise V° and hence the corresponding 
solutions span II. This yields dJ^ = on IX. If we choose G T(3?), there exists X G r(U) such that 
i x uJM + (3 e r(V°). Hence we get 

= (i x tu M + 0)(£ M ) = w M (X, £ M ) + 0(fa) = -dJf (X) + 0({ M ) = + 0({ M ) 

and therefore £ M G T(3l n V). □ 

Corollary 7.9 Assume that !H + V /ias constant rank on M. If dJ £ =0 on VflW £/iere exisi /3 G r(!K°) 
and r; e T(V n M) such that a* 1 = dJ« + 0. 

Proof: Since Pi = T*M (see Proposition Ofi)) there exists X G r(5£) such that (X,dJ s ) £ r(D). Hence 
we have dJ« = i x u M + 0' with /?' G T(J{ ) and since dJ« = on It we get X G U UM = (Oi n V) + Jf 1 ^ . 
Write X = V + Y with V G r(JC n V) and V G r(J£ WM ). Since X and V are sections of JC, then so is Y. 
But since "K n K UM = {0}, this yields Y = and hence X G r(5C n V). We find if 1 G r(g :H ) such that the 
corresponding section r; G r(V n 51) is equal to X and therefore (77, dJ £ ) G r(D). We get a v — dJ^ + /3 with 
G r(J£°), a nonholonomic Noether equation corresponding to the section rj^ G T(g^ c ). □ 

7.3 Optimal momentum map for nonholonomic mechanical systems 

In this and the next subsection we assume that Jf + V has constant rank on M. Recall from Remark 14.31 
that this implies that V C\"K and II also have constant rank on M. 

We show in this subsection that, under certain integrability assumptions, it is possible to restrict the 
system to "level sets" given by the nonholonomic momentum equations and then perform reduction. 
Consider the distribution where all ofi + a' defined as in Proposition 17. 31 vanish, namely 

x> G := [M D n ( v © v °))]° ® [00 7 nM) + JT) n v°]° = ((v n J£)<" M n jc) + v = it + v, 

where U := (V n M)""* nM C TM C TT*Q is the horizontal annihilator of V (see p5]) ). Note that 
D G = (V n M) WM + V. Since U C JC it follows easily that T> G n J£ = II + (V n Oi). If D G is integrable, 
its leaves are the level sets of the constants of motion and equations of motion given by the Nonholonomic 
Noether Theorem O for sections £ of (V n K) n (V n < K) UM : the fiber at m G M of the distribution 
7r 2 (£>n(V©V°)) equals 

{a(m) I a G T(V°) and there exists X G T(V) such that (X,a) G r(£>)}. 

Note that if this distribution is spanned by closed 1-forms, hence locally exact 1-forms, then it can be written 
as 

{(d/)(m) I / G C°°(M) G and there exists X f G T(V) such that (X f ,df) G T(D)}. 
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For every to G M we have 

dim [(V(m) n ^(to))^ n (V(m) n JC(m))] = dim [U(m) n (V(m) n JC(m))] = dim [IX(m) D V(m)] 

= dim 1X(m) + dim V(m) — dimD(j(m). 

Recall that IX and !Hn V are vector subbundles of TM. If, in addition, D G is integrable, then its fibers 
D G (m) have constant dimension along the leaves of the generalized foliation determined by TJ G and so the 
computation above shows that the fibers of (V n Ji) U3i n(Vfl M) along a leaf of D G are constant. Thus, the 
same is true for the fibers of T) G n "K = U + (V n %) since UnVnM=(Vn n(Vn3{). We shall use 

this fact in the next subsection where we describe the induced Dirac structure on a leaf. 

In order to restrict the system to the leaves of the distribution Dg and then perform reduction, we have to 
show several statements, the analogues of those needed for the Dirac optimal reduction. Since <&*Wm — ojm 
for all g G G, the proof of the following proposition follows easily. 

Proposition 7.10 The distribution (Vl~)'H) UJM is G-invariant in the sense that 

$* ((vnMD = (vnJfp 

for all g G G. Since V and 3i are also G-invariant, it follows that the distribution Dq = ((Vfl "K ) UM (~l 3i) + V 
is G-invariant. 

If T)q is integrable, define like in §5] the nonholonomic optimal momentum map 

2 : M -> M/T> G . 

We have a result analogous to Lemma 15.41 

Lemma 7.11 If m and m' are in the same leaf of Dq, i.e., if there is a X G T(T)q) with flow F such 
that Fj?(m) — m' for some t > 0, then $ g (m) and $ 5 (m') are in the same leaf o/D G for all g G G. Hence 
there is a well defined action of G on M/Dq: 

$ : G x M/D G -> M/T>g 
®g{3{m))=d{g-m) 

For all p G M/Dq, the isotropy subgroup of p contains G° (the connected component of the identity in G). 

Proof: Let g G G, m,m' G M be in the same leaf of D G , X G r(D G ), and F x the flow of X. For all 
s G [0, t] we have 

^ (4> 9 o Ff) (m) = T F x (m) 0> s (X(F s x (m))) = • (m)) G V G {g ■ F*(m)). 

Hence the curve c(s) = ($3 o (to) connecting c(Q) = Q g (m) and c(t) = $ s (to') lies entirely in the leaf 
of Dq through the point $ 9 (to) and the assertion follows. 

The Lie group G° is generated as a group by the exponential of an open neighborhood of G g. Thus, 
we can assume without loss of generality, that for any g G G° and to G M, there exists some £ G such 
that the curve 7 : [0,t] — * M, 7(3) = $ e xp(s{)( TO )) has endpoints to and g ■ m (in reality, the points to and 
g • to can be joined with finitely many such curves). For all s G [0,i], we have ■j(s) = S,m{j{s)) G D g (7(s)) 
and, arguing as above, we conclude that the whole curve 7([0,i]) lies in the leaf of D G through to. Hence, 
if p = 3(to), the equality $ fl (3(m)) = 3(p • to) = 3(to) proves the statement. □ 

Remark 7.12 The last statement shows that for all p G M/D G , the isotropy subgroup G p is the union of 
connected components of G and is therefore closed in G. This implies that the Lie group G p acts properly on 
the leaf 3 _1 (p). It is obvious that this action is also free. Recall that in the Optimal Point Reduction Theorem 
16.11 the properness of the G p -action on 3~ 1 (p) was not guaranteed. The reason why in the nonholonomic 
case this action is always proper is the inclusion V C D G . A 

Remark 7.13 Note that if the nonholonomic system satisfies 3~C © V = TM, then the bundle XL is given by 
IX = {0} Wjv/ R "K = % and hence D G = IX + V = TM is trivially integrable with the connected components 
of M as integral leaves. Hence, if M is connected, the method of reduction presented in the next subsection 
leads to the same reduced Dirac manifold as the Dirac reduction method of M4.21 A 
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7.4 Optimal reduction for nonholonomic systems 

Restrict the vector subbundles V, J£ := TAfn(T7rT*g) — 1 (D), and It = (Vn3~C)" M fl!K of TM to vector bundles 
V p , Jf p , and Up on the manifold Since the distribution (5£ n (Vn + V) n 5£ = 11+ (Vn !K) C £> G 

is constant dimensional on the leaves of Dg, the Dirac structure on a leaf d~ 1 (p) of 2)g is given by 

D 3 -H P )(m) ={(X(m), a m ) G Tr 1 ^) © T^OO | X G T (U p + (V p n H p )) , 

a-ixw a -i(p) er((Up + (v p nM p )) )} 

for all to G 3 _1 (/o) (see iBlankenstein and van der Schaftl (l200lh h here i p : ^H/ ) ^ M is the inclusion 
and tt>g-i(p) := i*ujm- Since for all p E M/T>g, the isotropy subgroup G p contains G°, the distribution V p 
spanned by the fundamental vector holds of the action of G on 3^ 1 (p) is V p = V| 3 -i( p ). 

Lemma 7.14 Let % p = V p © {0} and Xj = T3~ 1 {p) © V° as m JO] . Then D 3 -i [p) C\%^ is a vector 
bundle over 3~ 1 (p)- 



Proof: Since "K + V has constant rank on the n-dimensional manifold M, recall from Remark 14.31 that 
D fl % ± is a vector bundle on M. We denote r — rank "K, n — r — rank 5{° , Z = rank V° n 3"C° , u = rank U, and 
s = rank(U + (V n !H))\g-if p y Let m G 3~ 1 (p)- As in Remark l4.3[ choose local basis fields Hi, . . . , H r for "K 
and local basis 1-forms (3i, . . . , /3 n _ r for J£° dehned on a neighborhood J7 of m in M. Assume that Hi, ... , i? u 
are local basis helds for If, Hi, . . . , H s , with u < s < r, are basis fields for 11 + (V D "K) on 3~ 1 (p) H E7, and 
ft, ... , A a basis of V°nJ{ = (V + Note that the 1-forms ft,..., A vanish on U + V C J£ + V and 

that A+i, ■ ■ ■ , ft-r don't vanish on It + V (otherwise we would have ft G r(U° nV°nM°) = T(V° n M°) 
for j = i + 1, . . . , n — r, in contrad iction to the choice o f ft, ... , (3 n - r ). The Dirac structure Dg-it p \ is then 
given on [7 n by (see © or iBlankensteml <|2000h ) 



span 



{(Hi,i p i Hl tu M ), (H s ,i* p i Hs u M ), (0,i*/9i+i), . . . , (0, 7pA- r )} , (54) 



where Hi, . . . , H s are vector fields on f/fl 3 1 ( j o) such that £Zj for i = 1, . . . , s. Note that ipiH^M = 

ifri^a-Mp) for i = 1, . . . ,s. 

If {X,a) G D 3 -i (p) n (T3^(P) © V°) then X G T(U P + (V p n H p )), a G r(V°), and ixUg-i^ - a G 
r((Up + (Vp n Mp))°). This is only possible if i x w 3 -i (ri = on 

Vp n (lip + (Vp n Jf p )) = (Vp n u p ) + (v p n oc p ) = l p where £ := [(v n %) n (v n 5£) ww ] + (v n j£). 

We have two different cases. First, if X G T(U P ) then for all m G 3~ 1 {p) and V(m) G £(m) we have 
necessarily 

(ixWg-i( p ))(m)(V(m)) = (i*w M )(m)(X(m), V(m)) = WM(i lS H)(T m j p I(m),T m !/(m)) = 
where we have used 

T m i p x{m) g (Jtn(vn?{r) Mm)), r m i/(m) e H(i p (m)) 

and the definition of L. Hence, for all X G T(U P ) we have ixLOg-i( p ) \c p — and hence we find a G T(V°) 
such that (X, a) G r (Dg-if p \\. Second, for a section X of V n JC that doesn't take values in U p , the 1-form 
ixwg-i(p) doesn't vanish on V p n 5C P and thus neither on L p . Consequently, the sections of D n 3Cp have as 
first component a section of U p . Since for i = I + 1, . . . , n — r we have i*ft ^ r(V°), we get 

DnXj = span <^ \ ki uj 3 -i {p) + ^ a l x i*A ),...,(-&„, i^w g -i (p) + ^ a* i*ft ) I , 

where the functions a* are chosen such that ig-.^g- 1 ^) + S"=i+i a i*p^« are sections of V° for j = 1, . . . ,u 
and i = I + 1, . . . , n — r. Since the vector fields Hi, . . . , H u are linearly independent, we have found basis 
fields for D 3 -ii p \ on U. □ 
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Hence, t he reduced Dirac struc ture D p on 3 1 {p)/G p is given, according to the general considerations in 



( or see 



Bursztvn et"aH (|2007t l) by 



^ p - ^¥{0} / Gp (55) 

The next theorem gives an easier description of this reduced Dirac structure. 

Theorem 7.15 (Nonholonomic optimal point reduction by Dirac actions) Assume that the Lie group 
G acts freely and properly on M by Dirac actions. If Dq is an integrable subbundle of TM then for any 
p G M/Dq we have the following results. 

(i) The orbit space M p = 3~ 1 (p)/G p is a smooth regular Dirac quotient manifold whose Dirac structure D p 
is given by the graph of a nondegenerate (not necessarily closed) 2-form u> p . Denote byi p : S^ip) M 
the inclusion and by tt p : <J -1 (/o) — > M p the projection. 

(ii) Let h G C°°(M) be an admissible and G-invariant Hamiltonian and X^ the (unique) solution of the 
implicit Hamiltonian system (Xh,dh) G T(D). Then Xh G r(U) and we have (Xh\g-ir p -),i*dh) G 

(hi) The flow Ft of Xh leaves 3 (p) invariant, commutes with the G-action, and therefore induces a flow 
Ff on M p uniquely determined by the relation r K p oF t oi p — Ff o n p . 

(iv) The flow Ff is the flow of a vector field Xh p in X(M p ) that is the solution of the Hamiltonian system 
ix hp ^p = d/j p , where the function h p G C°°(M p ) is given by the equality h p o 7r p = h o i p . 

Proof: According to Remark 17.121 the G p -action on 3~ 1 (p) is free and proper. Thus, the quotient 
3~ 1 (p)/G p is a regular quotient manifold and the projection ir p : 3 (fi) — > M p is a smooth surjective 
submersion. We denote from now on by u)g-i^ :— i* p u>M the pull back of lum to 3~ 1 (p)- 
(i) With Lemma EH get 

(Da-HP) HJCp 1 ) +JC P = | ^ a) g r (/ Ta -i (p)/v ^ T*r\p)) X = X(mod V p ) with X G T(U P ), 

a G T(V° p ), and a - i X ^ 3 - Hp) G V ((U p + (V, n < K P ))°)} ■ (56) 

The Gp-quotient of this bundle defines the reduced Dirac structure D p on M p . 
Note that the fibers 

{Up/Vp) (to) := {Up + V p )(m)/V(m) = T m 3- 1 (p)/V p (m), m G 3^{p) 

of the vector bundle Up/Vp, project surjectively to T^ p ^ rn )M p . Like in for each G-invariant X G r(U p ) 

we identify X = X(mod V p ) with the section X of M p such that T7r p o X = X o n p . Write each G-invariant 
a G T(V°) as a — n*a for some a G fl 1 (M p ). 

Next we show that D p is the graph of a nondegenerate 2-form. We begin by giving a formula for this 
2-form u) p . Let X,Y G X(M p ) and choose G-invariant X, F G X(0 _1 (p)) that are 7r p -related to X and F, 
respectively. Write X = X+F and F = F + W with I.Ye T(l[ p ) G and V, W G T(V p ) G . Then X and F are 
also 7r p -related to X and F and we can write, using the existence of a G fi 1 (M p ) such that (X, a) G T(D p ), 

u p (X,Y)=a(Y) = (n*a)(Y) = uj Up (X ,Y), 

since X has to be the (unique) section of U p associated to the 1-form tt*q. (see (|7.14p ) and where uj\l p is the 
restriction of uj$-it p \ to U p x U p . 

We prove that oj p is nondegenerate. Let X G X(M p ) with u p (X,Y) = for all Y G X(M p ). Choose a G- 
invariant section X G r(U p ) as above. Extend X to a local vector field X on M, that is, X G r(U) C X(M) 
satisfies X X. For m G 3 _1 (/c) and u G U(m) C T m M we have 

wj£(m)(X(m),u) = wu(m)(X(m),i;) = uj p (7r p (m)) (X(7r p (m)), T m 7r p (v)) = 0. 
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Thus, the vector X(m) is an element of U(m) = (IK (~l V) W:k (to) that is W3<:(TO)-orthogonal to all v G U(m) 
and hence lies in ((IK n V) WM )" M (m). Since w M is nondegenerate, we have ((IK n V)"' 3 *)"' 3 * = MnV. This 
yields X(m) — X(m) G (IK (~l V)(m) and thus the vector X(m) is zero in T m M p . 

(ii) Recall that, since Go = {0}, the solution Xh of the implicit Hamiltonian system (X,dh) G T(D) is 
unique: if Y is another solution, then Y — Xh G T(Go) = {0m}- 

We know already that X h G T(U p ). Furthermore, we have for all Y G T(U p ), V G T(V p n IK p ) and all 

^- 1(p) (m)(X^(m),y(m) + V(m)) = w M (m)(X h (m),Y(m) + V(m)) = dh m (Y(m) + V{m)) 

= {i*dh){m)(Y{m) + V(m)) 

and the assertion follows. 

(iii) The fact that the flow of Xh leaves 3~ 1 (p) invariant follows from the preceding statement since we 
have X h G T(D G ). By G-invariance of D we have (<&* g X hl $* g dh) G T(D) for all g G G. Since h is G-invariant, 
the equality $*d/i = d$*h = dh holds and thus we have ®* g X h — Xh G T(G ) = {0a/}. The vector field X h 
is consequently G-equivariant and its flow commutes with the G-action. 

(iv) Since X h G T(\i p ) and i*dh G V°, we have 

d/i) g r (D a _i W n (Tr\p) ® vp) . 

The flow F t p on M p induces a vector field Xh p G X(M p ). Therefore, taking the i-derivative of the relation 
in (iii) we get 



F t P Mm)) 



t=o 



d 
dt 



{ir p o F t ) (to) = T m ir p X h (m), 



t=o 



that is, X h ~ 7Tp X hp . Choose Y G X(M p ), Y G r(K p ) G , and V" G T(V p ) G such that Tir p o (y + V) = Y o vr p . 
Then, for all m G we get 

Wp(7r p (m)) (X^(7r p (m)),?(7r p (m))) = (7r;w p )(m)(X h (m), K(m) + V(m)) = (7r> p )(m)(^(m), y(m)) 

= wj- 1(p) (m)(X fc (m),y(m)) = (i*cUi) m (y(m)) = (ir* p dh p ) m (Y(m) + V{m)) = (dh p )„ p{m) (Y(ir p (m))), 

so we have ix hp w P = dh p , as claimed. □ 
7.5 Example: the constrained particle in space 



We return to the example treated in M4. 31 and use the same notations and conventions. The distribution 
V n IK is pointwise the span of the vector field d x + yd z . Since V° is spanned by the covector fields dy, 
dp x , and dp y , the considerations in £|4. 31 yield id x +yd,^M = —(1 + y 2 )dp x — yp x dy G r(V°) and hence 
(d x + yd z , -(1 + y 2 )dp x - yp x dy) G T(D n (V © V°)). Hence the distribution D G is in this case ker{-(l + 
y 2 )dp x — yp x dy} = ker{d/}, where f(x,y,z,p x ,p y ) = \/T + y 2 p x is the constant of motion (in agreement 
with iBates and SniatvckJ ( 19931 ) ). Note that by Example 17.61 df is the 1-form giving the Nonholonomic 
Noether Theorem. Hence 

D G = span{d Py> d x , d z ,yp x d Px - (y 2 + l)d y } 

is obviously involutive and constant dimensional (and consequently integrable). This shows that M /Dq = M. 
The Dirac structure on a leaf / _1 (^t), p G M/T>g = K, of this distribution is given by 

D f -i {p) = {(X, a) G r(TTV) © T*r X (p)) I X G T(IK n D G ), a - i x i*u M G r((5£ n T» G )°)} 

and a computation yields 

D f- 1 M = s P an { ( d Pv > ~ d v) ' (dx+yd z: 0) (0,dz - ydx) , ((1 + y 2 )d y - yp x d Pa! , (1 + y 2 )dp y ) } 

because the 1-form (1 + y 2 )dp x + yp x dy vanishes on T . Since G is in this case connected, we have 

G p = G (see Remark l7.12p . Consider the codistribution V° on /~ 1 (/i) and get 

Df-iM n (T/ _1 (/i) © V°) - span {(d Py , -dy) , (0 X + yd z ,0) ((1 + y 2 )0 y - y P:c 9 Px , (1 + y 2 )dp y ) } . 
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This corresponds exactly to a symplectic leaf (with its associated Dirac structure) of the Poisson structure 
([5^]) obtained in the first part of this example (see <j4.3p . 

Finally we compute H for this example. Since 3~C° is one-dimensional, we get Ji = K° or 3? = {0}. Recall 
that D is the span of 

{(d Py ,-dy) , (d x + yd z , (1 + y 2 )dp x + yp x dy) , (0,dz - ydx) , {d y ,dp y - p x dz) , (d Px ,-ydz - dx)} 
where we have computed: 



Since It = span{<9 Py , (1 + y 2 )d y — yp x d Px , dx + yd z }, we conclude from 

\ 1+y 2 )dy _ yp!cdvx Uj M = (1 + y 2 )(dp y -p x dz) - yp x (~ydz - dx) 
= (1 + y 2 )dp y -p x (dz - ydx), 

that the distribution 51 is equal to 3~C° . The constant of motion we have found above is a gauge constant of 
motion. 

7.6 Example: the vertical rolling disk 

In this subsection we shall determine the Nonholonomic Momentum Equations for the example of the vertical 
rolling disk studied in tj4.4l The Dirac structure for this nonholonomic system is given by 




(1 + y 2 )dPx +VPxdy 

dp y - p x dz 

-dy 

—ydz — dx. 




where we have computed 



fiR sin tfi fiR cos 4> 
dp 4, H p dx 




Pedy 



19 oj m 



l de + R cos 0d x +R sin <j>d y ^> M 



dp e + R cos (f> I dpg - 




/ii? sin tfi 




We consider again the three possible Lie groups: 
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1. The case G = R 2 (|Cantriin et alj (|l998t) 1 

Here, V° = spanjdp^, d<fi, dpg, d9} but there are no nontrivial horizontal symmetries and hence the 
distribution Dq is simply the whole bundle TM. We next compute CR. The vector bundle D n 3C 1 - is 
given in this case by 

span j(<9 ,dp ) , ^<9 Pe , - ^1 + ^y-^ d0^ , (3p^) -d<£) , ^<9e + Rcos(j)d x + Rsm(f>d y , ^1 + ^j-^ dp, 

To get this, we have added 

liR llR 

—j-pe cos 4>{dx — R cos </>d#) + — — sin 4>(dy — R sin 4>d9) (57) 

to id Pe lum and 

pR 

— — pg sin 4>{dx — R cos 0d0) + — — cos (f>(dy — R sin 4>d9) (58) 
to ig^uiM- This yields 3^ = "K° and thus the distribution 31° n V is equal to !H n V and hence trivial. 

2. case G = SE(2) l|Blochl (|2003h ) 

In this case, we have V° = span{dp , dp 9 , d8} and J{ n V = span{<9 }. We get 3i n V C (5£ n V) WM . A 
direct computation gives 

pR sin <f> pR cos 4> 
la^jf = dp0 H j p e da; pedy. 

Adding 

liR iiR 

— —pe{cos <fidy — s\n<pdx)d = —j-pg (cos</>(dy — i?sin0d6 | ) — sin0(da; — i?cos</>d6*)) S r(!K ) 

to this expression we see that T(D n (V © V )) is spanned by (d^,dp^) and thus the distribution 
T>c = kerdp^ is obviously integrable. For a value p € R of the map j?^, the reduced Dirac structure 
on M p is spanned by 

de,(l + ^-)d Pe ) and ( dpe ,-(l+ f -^-)d9 



The Nonholonomic Nocther Theorem yields a constant of motion but this constant doesn't arise from 
an element of g whose corresponding fundamental vector field is lying in r(Vn3^°): we have computed 
in g]that, in this case, 11 is the span of the three vector fields d^, d pg , and dg + Rcos<j>d x + Rsmcf>dy. 
Again, we have to add ([57]) to ig p u>m and (|58[) to Iq^usm m order to get sections of V°. Thus, we need 
the whole of !K in the construction of D n 'X ± . 



3. The case G = S 1 x K 2 (iBlochl (|2003l )l 



Here, we have V° = spanjdp^, dpg, d<j)} and "K n V is again one-dimensional: this time it is the span 
of the vector field d e + Rcoscj)d x + Rsin(j>d y . Thus, T(D n (V © V )) is spanned by 

ui? 2 

9e + Rcos4>d x + Rsin4>d v , (1 H — — )dp< 

and the distribution Dq = kcr{(l + li j—)dpg} is again integrable. For a value p G R of the map pg, 
the reduced Dirac structure on M p is spanned by {d^, dp^) and (d p . , —d<f>). 

In this case, we have 11 = spanjc^, d Ptt> , dg + Rcos <f>d x + i?sin <j>d y }. We get H = span{ ^pg (sin 0dx — 
cos ^dy)} from the considerations for the second case. Thus we have 31° = span{9^,, <9 P0 , d Pe , dg, cos <pd x + 
sin c/)dy} and our constant of motion pg really arises from a fundamental vector field lying in 51°. 
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4. The case G = SE(2) x S 1 (jBlochl (|2003t )) 

In this last case, we have V° = spanjdp^, dpg} and "K H V is this time two-dimensional: it is the span 
of the vector fields dg + R cos (j)d x + R sin (j)d y and d<p. Thus, T(D n (V © V )) is spanned by (dg + 
Rcos(f>d x + R sin (j)d y , (1 + i£ j-)dp ) and (<9^, dp^) and the distribution D G = ker{(l + l± j-)dp e , dp^} 
is integrable. Here, the reduced manifolds are single points. 

We have U — span{<9^, dg+R cos 4>d x +Rsincj)dy} and we get as above 31° = spanjS^, d p<t> ,d Pe ,dg, cos 4>d x + 
sin 4>d y } . 



7.7 Example: the Chaplygin skate 

We continue here the examples of H4.51 

The standard Chaplygin skate. We have seen that Vn!K = span{<9g, cos6d x + sin9d y }. If we choose the 
basis := (1,0, 0), £ 2 := (0, 1,0), £ 3 := (0,0, 1) of the Lie algebrase(2) we get £- f = dg-yd x +xd y , ^ 2 M = d x , 
and £ 3 f = d y . Hence, the sections + yS, 2 — x£ 3 and cos#£ 2 + sinf^ 3 6 r($j M ) are spanning sections of 
and the corresponding Nonholonomic Nocthcr equations are s cos 9dp y — s sin 8dp x and cos 9dp x + sin 9dp y 
respectively. Consequently, the two spanning sections — s^sinf^ 1 + (cos 2 9 — s^y sm0)£ 2 + sin#(cos# + 
s~ 1 x)£, 3 and s" 1 cos6'^ 1 + cos#(sm# + s~ 1 y)£ 2 + (sin 2 9 — s~ 1 xcos9)$ l 3 of q^ 1 lead to the nonholonomic 
Noether equations dp x and dp y respectively. Thus, Dg = U + V = span{<9 x , d y , dg} is found easily because 
T>g is the kernel of {dp x , dp y }. This is obviously integrable. The induced Dirac structure on a leaf f~ 1 (a, b) 
(where / is the projection on (p x ,p y )) of T>q is given by 

Df-i(a.b) — span{(cos^9 x + sin 9d y , 0), (dg, 0), (0, sin#dx — cos9dy)}. 

Here the reduced space M( a b ) is a single point. The reduced Dirac structure is hence trivial, as can also be 

seen from the formula - s 1(a,b> !^ ( ( ° ) ''' ) ' >+ — ^~^/G(a,b)- 

We could also consider the action of S 1 on M given by $ : 8 1 x M —> M, (a,8,x,y,p x ,p y ) t— > (9 + 
a, x cos a — y sin a, x sin a + y cos a,p x cos a—p v sina,^ sin a + p v cos a). Here, we would have V n Oi = {0} 
except for the points satisfying x = — sin 9 and y = — cos 9, so the condition that V fl IK has constant rank 
is not satisfied (we have also V + 34 ^ TM). 

If we consider the action of R 2 on M given by $ : M 2 xM — » M, (r,s,6,x,y,p x ,p y ) i— > (0, x+r, y+s,p x ,p y ), 
we have V = span{<9 x , <9j,}. Hence, V+34 = TM and Vn34 = {cosflc^+sin^c^} has constant rank on M. The 
distribution is given by D q = {kei{cos9dp x + sin9dp y } n 34) + V = span{sin#<9 Px — cos 9d Py , dg , d x , d y } . 
This vector bundle is not involutive and hence it is not integrable. Since U = kerjeos^dp^ + sin9dp y } n 34 = 
spanjsin #<9 Px — cos 9d Py , <9g , cos 9d x + sin #<9 a } , it is easy to see that 3? = 34° and hence K°nV = VnJ( = 
{cos 9d x + sm#<9 a }, which confirms the fact that the nonholonomic Noether equation yields in this case no 
constant of motion. 

The Chaplygin skate with a rotor on it. We have V n 34 — sp&n{d < f>,dg : cos9d x + sin9d y }. If we 
choose the basis ^ := (1,0,0,0), £ 2 := (0,1,0,0), £ 3 := (0,0,1,0), £ 4 := (0,0,0,1) of the Lie algebra 
R x se(2) we get = d$, £f f — dg — yd x + xd y , — d x , and = d y . Hence, the sections £ x , 
£ 2 +y£ 3 — x£ 4 , and cos#£ 3 4-sin6>i; 4 € F(g M ) are spanning sections ofg M and the corresponding Nonholonomic 
Noether equations are dp^, scos9dp y — ssin9dp x + dp r j )1 and cos^dp^ + s\n9dp Vl respectively. Thus, the 
three spanning sections f 1 , s _1 sin0^ 1 — s~ 1 sin0^ 2 + (cos 2 9 — s _1 ysin6')^ 3 + sin0(cos6* + s _1 x)^ 4 , and 
— cos^ 1 + s _1 cos(9£ 2 + cos9{suv9 + s~ 1 y)!; 3 + (sin 2 9 — s -1 ! cos #)£ 4 of lead to the nonholonomic 
Noether equations dp^, dp x , and dp y , respectively. Thus, Dq = IX + V = span{3 x , d y ,dg,d,p\ is found easily 
because Dq is the kernel of {dp^,dp x ,dp y }. This is obviously integrable. The induced Dirac structure on 
a leaf / _1 (a, 6, c) (where / is the projection on {P(j>,p x ,p y )) of Dg is given by 

Df-i(a t b,c) — span{(cos^9 x + sin#<9 y , 0), (<9g, 0), (d$, 0), (0, sin#dx — cos#dy)}. 
Here the reduced space Mi a b <c s is a single point. The reduced Dirac structure is hence trivial, as can also 
be seen from the formula ^~ 1 (". b - c )^^- b - c >) +JC( "' t '- c) /G( a ^, c )- 

Finally, note that in the last two examples, we have 3? = {0} and hence 51° n V = V. This is why we get 
in the first of the two examples the three constants of motion p x , p y , and pg belonging to the three elements 



48 



£ 2 , £ 3 , and of g and in the second example the constants p x , p y , pg, and p^ belonging to the four elements 
£ 3 > £ 4 : £ 2 j an d S, 1 of g. Note that in this case, the constancy of follows already from the existence of the 
constant section £ 1 of g M . 

Like in the previous example, the other symmetry groups of the system (the "^-symmetry" § , "the 4>- 
symmetry" S , S 1 X S , S 1 x M 2 , SE(2)) are not interesting for the method of reduction presented in this 
section. 



7.8 Example: the Heisenberg particle 



At last, we present an example where the reduced form is not closed. It can be found in lBlochl (|2003f ). The 
configuration space Q is R 3 with coordinates {x, y, z) subject to the constraint z — yx — xy. The Lagrangian 
on TQ is given by L(x, y, z, x, y, z) — \{x 2 + y 2 + z 2 ) and hence the Legendre transformation yields 

Px =x, p y = y, p z = z. 

For (x, y, z,p x ,p y ,p z ), we have p z = yp x — xp y . Hence, we have the global coordinates (x, y, z,p x ,p y ) for M 
and the 2-form lum is given by 

ujm = ds A dp x + dy A dp y + dz A d{yp x — xp y ) 

= dsA (dp x + pydz) + dy A (dp y - p x dz) + dz A (ydp x - xdp y ). 

The vector bundle !H is given by !H = kerjdz — ydx + xdy} — span{y9 z + d x , xd z — d y , d Py } and we 
compute 

iyd z +d x u M = y(ydp x - xdp y ) - yp y dx + yp x dy + (dp x + p y dz) 
i x d z -dyUM = x(ydp x - xdp y ) - xp y dx + xp x dy - (dp y - p x dz) 
id Px UM = -da; - ydz 
id Py UM = -dy + xdz. 
Thus, we get the smooth global spanning sections 
{ (yd z + d x , (y 2 + l)dp x - xydp y - yp y dx + yp x dy + p y dz) , (9 Px , -dx - ydz) , 
(xd z - d y , -(x 2 + l)dp y + xydp x - xp y dx + xp x dy + p x dz) , (d Py ,-dy + xdz) , (0,dz - ydx + xdy) } 
for the Dirac structure D. 

Consider the action <f> : R x Q — > Q of the Lie group G = R on Q, given by <f>(r, x, y, z) :— (x, y, z + r). This 
action obviously leaves the Lagrangian and the constraints invariant. The induced action $:GxM-t M 
is given by $(r, a;,y, z,p Xl p y ) := (x,y,z + r,p x ,p y ) and hence the vertical bundle in this example equals 
V = span{<9 z }. We get JCn V = {0} and hence U = %. The two methods of reduction (in g2]and ff^ i ea d 
in this case to the same result since the distribution T>g — U + V = CK + V = TM is trivially integrable with 
M as single leaf. The reduced Dirac structure -D re d on M with coordinates {x, y,p x ,p y ) is thus given by 

Dred = x / G 

= span { (d x , (y 2 + l)dp x - xydp y - yp y dx + yp x dy + p y (ydx - xdy)) , (d Px , -dx - y(ydx - xdy)) , 

(-d y , -(x 2 + l)dp y + xydp x - xp y dx + xp x dy + p x (ydx - xdy)) , (d Py , -dy + x(ydx - xdy)) } 

= span { (d x , (y 2 + l)dp x - xydp y + (yp x - xp y )dy) , , -(1 + y 2 )dx + xydy) , 

(-d y , -(x 2 + l)dp y + xydp x + (yp x - xp y )dx) , (d Py ,-(l + x 2 )dy + xydx) }. 

Note that this is the graph of the 2-form 

^rcd = (1 + y 2 )dx A dp x + (1 + x 2 )dy A dp y + (yp x - xp y )dx A dy - xy(dx A dp y + dy A dp x ). 

A direct computation shows that the determinant of o; re( j equals (1 + x 2 + y 2 ) 2 ^ on M which shows that 
the form w rG( j is nondegenerate. The equalities 

du> rcd (d x , d y , d Pm ) = - 2y and dw rod (d x , d y , d p J = 2x 

show that w ro d is not closed. 

Note also that in this example we have X = %° and hence S° nV = MnV = {0}. 
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A Push-down of distributions 



In t he main text of this paper we have u sed twice a rather technical proposition on "control" of distributions 
(see lNiimeiier and van der SchaftJ (Il990h 1. Due to its importance we present here a complete proof which is 
inspired by the work of Icheng and Tarn ( 19891 ). 

Recall that a distribution D C TM is said to be locally finite if for each point m G M there are an open 
neighborhood U C M of m and smooth vector fields X±, . . . , X r G X(U) such that at each point x G U we 
have span{Xi(x), . . . , X r (x)} — D(x). Note that locally finite distributions are necessarily smooth. 

Proposition A.l Let V C TM be an involutive vector subbundle of TM and D a locally finite smooth 
generalized distribution on M . Assume that 

[r(D),r(v)]cr(v + D). (59) 

Let X be a vector field on M satisfying 

[jr,r(V)]cr(v + D). (60) 

Then for each p G M there is an open set [/CM with p G U and smooth D -valued vector fields Z, Z\, . . . ,Z r 
on U satisfying 

(i) D(g) = span^te), . . . , Z r {q)} for all qeU, 

(ii) [Z u r(V)] C T(V) on U for alii = 1, . . . , r, anrf 

(ih) [x + z, r(v)] c r(v) on u. 

Proof: Let n := dimM and fc := dimV(x), for a; G M. Since the vector subbundle V is involutive, it is 
integrable by the Frobenius Theorem and thus any p G M lies in a foliated chart domain U\ described by 
coordinates (x , ...,x n ) such that the first k among them define the local integral submanifold containing 
p (see ^2.21 for a review of these notions). Thus, for any q £ U\ the basis vector fields d x i, . . . , d x k evaluated 
at q span V(q). 

Because D is locally finite, we can find on a sufficiently small neighborhood U C U\ of p smooth vector 
fields Xi, . . . , X r spanning CD, i.e., for all q G U we have 

■D(q)=spaxi{X 1 (q),...,X r (q)}. 

Write, for i = 1, . . . , r 

n 

Xi = ^ Xjd x j, 
i=i 

with Xf local smooth functions defined on U for j = 1, . . . , n. By hypothesis (|59|) we get for alH = 1, . . . , r 
and I = 1, . . . , k: 

n 

d xl (x^ ■= [d xl ,x i \ = Y J d x > (xi)d xi g r(v + d). 

Hence we can write 

n k r 

d x i (Xt) Y^n, [x()d xj = x 4^^ + E "/.^ 

with A\ t G C°° (C7) for i = X, . . . ,r, j,l = 1, . . . ,k and Bfj G C°°(U) for i,j = 1, . . . , r, I = 1, . . . , k. Set 

n 

for i = 1 , . . . , r and get 

n r 

- [d*hXi] - E ^ = E4^- 
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We rewrite this system as 

(d x i(X 1 ),...,d x ,(X r )) = (x 1 ,...,x r ^B l , 

where Bi — [B 3 H ] is an r x r matrix with entries B 3 U S C°°(U), i, j — 1, . . . , r. 

Now fix j e {1, . . . , fc}, think of x 3 as a time variable and all the other x l as parameters, and consider the 
following ordinary differential equation 

d xl Y = BjY. (61) 

Let Y 3 ,..., Yj be r linearly independent solutions of . Set Wj = (Y/ , . . . ,Yj) which is an invertible 
matrix. Since the rows of (Xi, . . . ,X r ) (where we think of this as a (n — fc) x r-matrix with columns 
(Xj fe+1 , . . . , X") T for i = 1, . . . , r) are also solutions of (f6Tj) . we know that there exists a r x (n — fc) matrix 
Lj with C°° entries such that 

(X v r : u,/.,. ./ l fc 

where Lj is independent of Xj (which is the independent variable of differential equation (|61|) V Therefore, 
we have 

WxLx = W 2 L 2 = ■■■ = W k L k . (62) 

Because W% is nonsingular, we have L 2 — W% W\L\. Set x 2 = on both sides of this equation and 
get L 2 = WfVi 1^=0X1(0 ,x 3 , . . . ,x n ) since L 2 is independent of x 2 . The matrix H 2 := W 2 1 Wi\ X2=0 is 
smooth and nonsingular and L 2 = H 2 Li(0, X3, . . . ,x n ). Recursively, assume Hi is a well-defined smooth 
nonsingular matrix and Li{x) = HiL\{Q, . . . , 0, 2^+1, . . . , x„). Using (|62|) . get 

L !+ i(x) = W^WifliLifO, . . .,0,3^.1, ...,i„) and let H l+l := Wr^WiHi\ Xi+1 =o. 

Since Li+\ is independent of Xi+i, we have Li + i(x) = Hi + iLi(0, . . . ,0,Xi +2 , . . . ,x n ). Finally, get Hj~ 
and Lk{x) = HkLi(0, . . . ,0, Xk+i, ■ ■ ■ ,x n ). Define the smooth nonsingular matrix H := WkHk, and L := 
Li(0, . . . , 0, Xk+i, ■ ■ ■ , x n ) which is independent of x%, . . . , Xk- Then (Xi, . . . , X r ) T = HL. Define 

B = {H T )- 1 = {H- l ) T . (63) 

Then (Xi, . . . , X r )B — L T is independent of x%, . . . , Xk- This yields 

^,((li,...,X r )B)J =0 (64) 

for / = 1, . . . , fc and i = 1, . . . ,r. In this formula, we denote by {{Xi, . . . , X r )B)i the ith column of the 
matrix (Xi, . . . ,X r )B, considered as the representation of a local vector field in the basis {d x k+i, . . . ,d x n}, 
i.e. if 

(Xi, . . . ,X r )B — [Cjl]j—k+l,...,n 
1=1, ...,r 

we write {{Xi, . . . , X r ) ■ B)i for the smooth vector field 

n 

{{X 1 ,...,X r )B) i := ( 'j- n - ■ ( 65 ) 
j=k+l 

Let Z\, . . . , Z r be the local vector fields defined by Zi = {{X\, . . . , X r )B) i for i = 1, . . . ,r, where again, if 
we have 

(Xi, . . . ,X r )B = [Cji]j=i : ... t n 
1=1, ...,r 

we write ((Xi, . . . , X r )B) i for the smooth vector field 

n k n 

^ \ Cjjd x j = y * Cjjd x j + y y Cjid x j . 

We get (with the identification (|65|) above) 

Z i = £ i + ((X 1 ,...,X r )B)i 
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where 



6 y^C,,'),. < l(V;. 

j=l 



Thus, we have for I = 1, . . . , k, 



[d x i , Zi] = [d x i , + [a,, , ((Xj , . . . , x r ) ■ b) ] W [a a . , 6] + o e r(v) 



for i = 1, . . . , r. Hence, if we write an arbitrary section r/ G r(V) as 

k 

3=1 

with smooth local functions r/i, . . . ,r]k, we get for i = 1, . . . , r: 

k 



foZi] 



^a^ i Zi 



Vry^^ + V^fe]^ er(V). 



Thus, since by construction, Z\, . . . , Z r also span Donf these vectors fields satisfy the first two statements 
of the proposition. 

For the third statement, note that if X = X)j l =i °^d x j with C°°-functions a 1 , . . . ,a n , (|60|) yields 

n 

[d x t,X] =Y / d x i(a j )d xi er(v + D) 

3=1 

for I = 1, .. ., fc. This leads to X)?=i ®x l { a ^)®xj — Ylj=i a \®xi + Sj=i /^f-^j w i tn C°° -functions . . . ,/3[, 
C°°-functions a}, . . . , af , and Xl, . . . , X r as above. Hence, if wc define 

n 

x '■= a?d xi , 

we get for Z = 1 ..... A; 



j=k+l ] = 1 

where Pi is the (r x l)-matrix with the entries fif, . . . ,/3J. From the definition (f6"5)) of the matrix B we get 
for j,l = l,...,k: 



. . .,l r )5 [9 x! , (tf T &)] = [a x i, (*i, . . .,X r )Bif T /^] = (Xr, . . . ,l r )/3 J 

- [a,, , [a,, , !]] = , [d xl ,x]] = (x 1 ,..., x r )B [d xi , {H T p t )] 

where we have used the Jacobi identity and [d x i , d x j] = in the third identity above. 
Define now the (r x l)-matrix with C°°-entries 



7 = (71) ■ ■ ■ ;7r) T 



B 



,T,X 



Jo 

+ •••+/ (H T p 1 )(T,0,...,0,x k+1 ,...,x n )dT 



(66) 
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Then for I = 1, . . . , k a computation using (|66[) and the definition of 7 leads to 

d xh (x + . . . ,X r h)] = (X ± , . . .,X r )0i - {X X , . . .,X r )Pi = 0. 
Thus the desired vector field satisfying the third condition in the statement of the proposition is 

r 

z = (x u ..., x r ) 7 = J2 Xuik e r(B), 

which we can also write as: 

r r 

Z = {X 1 ,..., A r ) 7 + (X 1 ,..., l r ) 7 = x klk + J2 



(67) 



fc=i 



fe=i 



where X := £j =1 and = £*=i *i Qrf G r ( v ) for i = 1, . . . , r. 

Indeed, for any £ = 1, . . . , k, since d x i , . . . , c^*.- is a basis of the space of sections of V over U, we get 



[X + Z,d xl ] = 



X + Y,Xklk,d x 



fe=i 



fc=i 



eg 



X + ^2 x k7k, d x i 



k=l 



+ e r(V) 



since it follows from the definition of X and Xj, i = 1, . . . , r, that X + J2k=i x k7k G r(V). As in the first 
part of the proof, we get [X + Z, T(V)] C r(V) on J7 and the proposition is proved. □ 

Proposition A. 2 Assume that the Lie group G acts freely and properly on the smooth manifold M , let 
M := M/G be the orbit space, and denote by n : M — > M the principal G-bundle projection. Let V C TM 
be the vertical subbundle of this action. If X £ X(M) is a smooth vector field satisfying [A, r(V)] C F(V), 
then there exists X G X(M) such that X ~ n X . 

Proof: If necessary, shrink the domain U o f definition of X such th at U is contained in a tube for the 
action of G on M (see, e.g.. IPalaid (|l96lh or lOrtega and Ratiul (|2004 ) Theorem 2.3.28). Hence, since the 
action is free, we can find smooth coordinates {gi, . . . , g k , x\, . . . , x n -k} on U such that the projection map 
7r is given in this chart by 

7r : (gi, ...,g k ,xi,.. .,x n - k ) [x\, . . -,x n - k ) 
and the vertical space V is spanned by the sections d gi , . . . , d 9h . Write the smooth vector field X as 

k n—k 

X = J2 a i d Bi + zZ ''>'>< 
J'=l i=i 

with smooth functions a\, . . . , a k and 61, ... , b n - k defined on U. For / = 1, . . . , k we get 

k k n—k n—k 

j'=i j=i i=i i=i 

fe n — fc 

3=1 3=1 

Since this is an element of r(V), we conclude that d gi (bj) = for I = l,...,k and j = 1, . . . ,n — k. 
This means that the functions 61, . . . ,b n — k are independent of the variables gi,---,g k and we can define 
A" = Y^jZi bjd Xj G £(M). We have then for all p = (gx, . . . , g k , X\, . . . , x n - k ) G U and p = ir(p) = 
(x%, . . .,x n -ie) G 7r(f7): 

/ k n—k \ n—k 

T p tt (X(p)) = T p 7r £ a ^9 gj l P + E 6 i(P)^i I* = E 6 i(P)^ I? = ^(P)> 



as required. 



□ 
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Corollary A. 3 The local vector field X £ X(M) satisfies [X, V] £ T(V) for every V £ T(V) if and only if 
there exists X G G X{M) G such that X - X G £ T(V). 

PROOF: We continue using the notations in the proof of Proposition IA.2I Note that X G := Y^j=i bjd Xj 
is G-equivariant and that X — X G = Y^j=i a j®gj e r(V). Conversely, let X £ X(M) be such that there 
exists X G £ X(M) G satisfying W := X - X G £ T(V). Let us show that for any V £ T(V) we have 
[X,V] = [X G ,V] + [W,V] £ T(V). Since V is involutive, we conclude that [W, V] £ T(V). To see that 
[X G , V] £ T(V) write X G — X)j=i tyj®xj + Sj=i a 'jd§j an d n °te that since X G is G-equivariant, we have 
®Bi ( a 'i) — ®9j = ior j = 1, . . . ,k , i = 1, . . . ,k and / = 1, . . . , n — k. For any local section V of r(V), 
write V = X)j=i v i®9j an< ^ compute: 

h k I n-k k \ k 

[X G , V]=Y / X G { Vl )d gi {b' 3 )d X] + d gi {a' 3 )d g3 = J2 X G { Vl )d gi £ T(V). □ 

i=i i=i \j=i j=i I i=i 
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